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Abstract 

It is well known that a generic small perturbation of a Liouville-integrable Hamiltonian system causes 
breakup of resonant and near-resonant invariant tori. A general approach to the simple resonance case 
in the convex real-analytic setting is developed, based on a new technique for solving the Hamilton- 
Jacobi equation. It is shown that a generic perturbation creates in the core of a resonance a partially 
hyperbolic lower-dimensional invariant torus, whose Lagrangian stable and unstable manifolds, described 
as global solutions of the Hamilton-Jacobi equation, split away from this torus at exponentially small 
angles. Optimal upper bounds with best constants are obtained for exponentially small splitting in the 
general case. 
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1 Introduction 



The notion of Arnold diffusion refers to a generic instability of Hamiltonian systems with three and higher 
degrees of freedom ^ [3] . A notable exception are Liouville-integrable systems allowing the construction of 
global action- angle variables [5]. Small perturbations of such systems provide a natural set-up to study the 
instability. Recently Mather [21] using methods of analysis in the large (which to a great extent had been 
created by himself) |22l l2"H] announced the proof of the existence of Arnold diffusion in the three degrees of 
freedom (convex, real-analytic) case. 

Consider a Hamiltonian system of n + 1, n > 2 degrees of freedom in the cotangent bundle T*T n+1 = 



W i+i x jn+i of a torus T n+i = (]R/27rZ) n+1 . Take an open convex domain n C R n+1 . The phase space 
Ai = fl x f n+1 has a natural exact symplectic structure tr>. Consider a Hamiltonian function 



Suppose H is real-analytic, i.e. it can be extended holomorphically into a neighborhood of M. in C 2n+2 , let 
| • \m above be the supremum-norm. Also suppose that H is strictly convex. Then one can simply take 
Q = {p e W l+1 : H {p) < E } for some E > 1. 

If (Pi q) are (global) canonical coordinates on Ai, or the action-angle variables j^j with to = dq A dp, the 
Hamiltonian has an expression 



The Hessian matrix D 2 Hq(p) is positive definite for every peO;0<e<lisa small parameter. The 
perturbation Hi is 27r-periodic in each angle q.j, j = 0, . . . ,n. The system Q is autonomous 1 . If e = 0, it 
is Liouvillc intcgrablc. Its phase space is foliated by invariant tori, whereupon p(t) = const., and q(t) = 
const. + u)(p)t : where w(p) = DHq(p) is a frequency. Each torus is a Lagrangian manifold. 

The central question of local analysis of system is what geometric objects replace the invariant tori 
when e ^ 0. The KAM theorem ^|^2E3EH] asserts that as e — ► 0+, an asymptotically full measure set 
of these tori is stable. 

Resonant unperturbed tori are foliated by tori of lower dimension. The property of a torus being resonant 
or non-resonant is clearly intrinsic, as well as the notion of the multiplicity m of a resonance, i.e. the difference 
in the dimensions of the original resonant torus and the minimum foliation torus. In the above coordinate 
representation the resonances correspond to the values of the action p, when the components of the frequency 
vector oj(p) € R™ +1 are linearly dependent over the integers Z, m being the dimension of the kernel of a 
linear map u)[k] = (fc,o;), for k £ Z n+1 . Resonant tori as well as the non-resonant ones sufficiently close to 
the former, typically get destroyed for any e ^ 0. The set of destroyed tori is residual on the unperturbed 
energy surface Hq 1 {E) for a regular value of E. It is known [3^1 that given a specific resonance, the majority 
(in the sense of the Lebesgue measure in R n+1 ~ m ) of the corresponding resonant tori result in particular in 
the appearance of partially hyperbolic, or whiskered tori of dimension less by m — 1, . . . ,n. Characteristic 
exponents of these tori are typically 0(y/e). Singular perturbation theory for manifolds asymptotic to these 
tori has a number of subtleties [HI 13 1141 1151 l.'itil which would not be there, were the above characteristic 
exponents 0(1), see also [71 IfH ITT] . 

Non-resonant tori, sufficiently close to resonances experience a complicated topological perestroika. If n = 
1, the result is a cantorus |22| supporting an invariant action- minimizing measure |23j . Higher-dimensional 
relatives of cantori are not so well understood, unless the local analysis can be in a sense reduced to the n — 1 
case H2J. 

The purpose of this paper is to develop from scratch the local theory for a simple resonance, m = 1. 
A resonance is identified by an integer lattice point feo G Z' i+1 \ {0}. An unperturbed torus, marked 
by p = p e Q is resonant with respect to ko iff the corresponding frequency uj = <jj{p ) lies on the 

1 The case when Hi depends on time periodically can be treated in the usual way whereupon the convexity assumption 
about Ho should be substituted by quasi-convexity |19l and the non- degeneracy assumption in Theorem|2]- by isoenergetic non- 
degeneracy Convexity is far the easiest non-degeneracy assumption to deal with; for more subtle non-degeneracy settings in 
the KAM theory see e.g. 1351 . 



H : M^R, such that \H - H \ M = e < 1, for some H : Q — *■ K. 



(1) 



H = H(p,q,e) = H (p)+eH 1 (p,q,e), (p, q, e) e x T" +1 x K+. 



(!') 
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"resonant hyperplane" {u> € IR™ +1 : (ko,ui) = 0}. As Hq is smooth and strictly convex, the "frequency 
map" p — > u>(p) is a global diffeomorphism. The values of the action p satisfying the above resonance 
condition lie on a smooth hypersurface in R n+1 , which intersects each regular level set of Ho transversely 
(for otherwise (k , D 2 H (p )k ) = 0) and is a graph over the hyperplane (k ,p) = 0. Thus metric and 
topological properties of sets on the resonant hypersurface can be described in terms of their images in the 
resonant hyperplane, via the frequency map. 

Given ko, one chooses a value p on the intersection of a regular level set of Hq with the resonance 
hypersurface, such that the corresponding frequency ljq is non-resonant over Z™ +1 modulo one-dimensional 
sub-lattice generated by kg. If one denotes the corresponding unperturbed simple resonance (n+ l)-torus as 
To, the latter is foliated by a one-parameter family of n-tori, which can be parameterized by some i£T: 



It is assumed that Wo is "far enough" from higher multiplicity resonances. To express the latter property, 
Kolmogorov's Diophantine condition J7J over the quotient lattice is used. The set of all such frequencies ljq 
has a positive Lebesgue measure on the resonance hyperplane 

Study of simple resonances and their role in global dynamics for the general system Q had begun at 
least as early as Poincare [23 - Arnold pQ used a simple resonance model to suggest a local mechanism for 
universal instability, or diffusion, based on the existence of intersections of Lagrangian manifolds, asymptotic 
to whiskered tori, alias the splitting of separatrices phenomenon. Splitting in a more general context was 
studied by Chirikov (Kj emphasizing its role in the general diffusion scenario and conjecturing a number of 
generic asymptotic exponentially small bounds apropos of the splitting and the diffusion speed. For the 
latter, the theorem of Nekhoroshev [2H1E1I2EI gives the upper bound ~ cxp ^e^ 2 ""™ 1 ) J . 

More recently models for simple resonances and splitting of separatrices have been investigated in a great 
number of works, see EJ EI El HH HI IZ3 among others. For a more extensive bibliography list see the 
treatise [23 by Lochak et al, to which one can add some 30 more titles which have become available since the 
year 2000. The latter work [23 among other things develops a normal form theory for local near-resonance 
dynamics, see also ^5J[2E])- However, the underlying multiple step averaging procedure is rather general 
and does not allow to study the splitting in all the detail. As an alternative Lochak et al advocate the 
Hamilton- Jacobi method, which they illustrate for a particular Hamiltonian from the Arnold example pQ 
(also published separately as |37p and draft formulations of a number of theorems, which are proved herein. 

A fundamental question apropos of exponentially small splitting (to which the Nckhoroshev-like normal 
form theory fails to provide an answer) is one of the best constants for the upper estimates involved. Such 
constants have been obtained for various cuts of a specific model, coupling a pendulum-like one degree of 
freedom Hamiltonian system with a bunch of rotators [71 1141 ITUlll5H36l l^|. Ideally, the upper bounds would be 
supported by lower bounds, which constitute a very delicate issue and are available only for a few particular 
examples ^3 EH1 El 12] ■ The issue is not addressed in this paper. 

An important result concerning the splitting problem in the general simple resonance context is due to 
Eliasson |13) (see also [5]) who proved the estimate 2n + 2 on the minimum number of homoclinic orbits to a 
whiskered torus of dimension n at the resonance core, but not the exponentially small splitting estimate. The 
main building blocks for the splitting theory near a simple resonance are presented in |21) although many are 
without proofs, apparently due to a variety of technical difficulties. This paper attempts to do it, as it turns 
out that most of these difficulties can be bypassed owing to a technique, rather different from those used 
in the above listed references (for the exception of [37]) and appears to be more "natural" for the problem 
involved. The technique certainly applies to the above mentioned model, for which it gives the (known) best 
constant J and also shows that the latter is the largest value that the best constant in question can assume 
in principle. 

In essence, our technique is the Hamilton-Jacobi approach prompted by Poincare |27j cast as a "hyperbolic 
KAM theorem" . However it is developed in an entirely different geometric context than the traditional one 
founded by Graff JS]- The present geometric scenario was founded in [31]; this paper shapes it into "KAM 
theory on semi-infinite bi-cylinders over tori" . 




(2) 
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The paper is organized as follows. Section 2 starts out with the preliminaries in order to describe the standard 
normal form near a chosen simple resonance (|2.6[1 and Lemma 12. II A non-degeneracy Assumption ^ is made 
concerning the "hyperbolic part" of the truncated (integrable) normal form, whereupon the splitting problem 
is set up somewhat heuristically for the localization (|2.14|) of the normal form Hamiltonian near the truncated 
normal form separatrix. The set-up emphasizes what is called a "sputnik" property (|2.16l) thereof, combining 
the 27r-periodicity of the foliation (J2J and reversibility of the truncation (|2.7|) of the normal form Hamiltonian 
(|2.6[) . At that point one of the main results, Theorem ^ of the paper is formulated. The formulation is still 
somewhat heuristic, due to the necessity of developing a certain amount of machinery. 

Section 3 develops this machinery, underlying the aforesaid version of KAM theory. It is based on a simple 
holomorphic map introducing "energy-time" coordinates l|3.1l I3.5|> in which the base space is not compact. 
Section ETT1 is almost entirely dedicated to the relevant formalism. The hyperbolic KAM theorem, Theorem 
follows, providing global generating functions for perturbed separatrices as the solutions of the Hamilton- 
Jacobi equation, Corollary 12.11 The proof of Theorem [21 incorporates a two-parameter trick, yielding an 
optimal (in the sense of the parameter dependence) smallness condition (|3.33() . 

Section 4 presents the theory for the splitting, based on application of Theorem [21 and a global sputnik 
property, Assumption [3 The role of the sputnik is to ensure that the one- form giving the splitting distance 
be exact, which allows one immediately to give a lower bound for a number of homoclinic orbits TS*] . This 
is inherent in the homoclinic splitting problems, being expressed by relations 1)2.161 12.221 14. 4|) . The main 
theorem of the section, Theorem[31claims the principal exponentially small estimate (|4.13|) which is adapted as 
(|2.19() to the simple resonance normal form in Theorem^ concluding its proof. The estimate l|2.19[) contains 
a pair of best constants (/?, 02), well defined for a specific Hamiltonian with a given analyticity domain, see 



2 Normal form near separatrix at a simple resonance 

This section prepares the Hamiltonian Q for the set-up of the theory developed in the sequel. It consists in 
choosing a simple resonance action value and restricting the Hamiltonian to its small neighborhood, where 
a suitable normal form can be produced. Under generic assumptions, this normal form can be viewed as a 
perturbation of an integrable reversible system containing a separatrix. These steps are standard, see e.g. 
[391 IT3"| . However the further analysis is essentially different from that one traditionally encounters in the 
literature. The Hamiltonian gets localized near one branch of the separatrix of the truncated normal form 
Hamiltonian. Localization near the other branch can be seen from the former one as a symmetry, referred to 
as a sputnik. 

2.1 Preliminaries 

Canonical transformations of the phase space 

Throughout the paper, a number of canonical transformations is introduced. These transformations belong 
to an "affine" class, corresponding to the phase space bundle structure. 

For the system the phase space is a subset of T*T n+1 , with canonical coordinates (p, q) £ R™ +1 x T n+1 . 
An automorphism a of the base space induces a family of canonical transformations 



parameterized by a closed one-form dS on the base space. As the latter is T n+1 , the one-form dS is described 
by the generating function S(q) = (£,q) + S(q), with some £ € R n+1 = [dS] specifying the fl^T^M) 
cohomology class of dS, and some function S(q), which is a zero-form on T" + , i.e. is 27r-periodic in each 
component qj, j = 0, . . . , n of q. 

The notation (, ) stands for the canonical coupling between (and later R n ) and its dual space and is 

identified with the Euclidean scalar product; l (c?a) 1 denotes the transpose inverse of the Jacobi matrix da. 



fT7ll3~31) . 




(2.1) 
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The case when the map a is linear, so a and da can be identified with a matrix from SL(n + 1,Z) is 
referred to as a symplectic rotation. The one-form dS effects a shift of the origin within each fiber. This shift 
is fiber-independent if S(q) = (p , q), given p S R" +1 , which then becomes the origin in each fiber. 

Simple resonance Diophantine condition 

A resonance is identified by a minimal integer lattice point k G Z n+1 \ {0}. I.e. there is no fc S Z n+1 , such 

that k Q = jk, j = 2,3, Then in any lattice basis e , . . . , e n , the components of fc are relatively prime. 

The choice of the basis {ej}j=o...., n determines a coordinate chart (jp, q): take qj £ ~Rej /2ir'Zej , j = 0, . . . ,n, 
let pj be a momentum canonically conjugate to qj. 

Consider a one-dimensional lattice Zfco and a direct sum decomposition Z' i+1 = Zfco© Z ra+1 /Zfco. Choose 
a lattice basis in the quotient lattice Z n+1 /Zfco and let fci, . . . , k n £ Z™ +1 represent it in Z n+1 , so {fcj}j=o,...,n 
is a lattice basis in Z" + . For a moment, let us call it a direct sum decomposition basis, generated by feo. 
One can expand each kj over the "old" basis {ei}i=o,... y n- As in the first equation in (|2.1I) let us write it as 
k = a(e), where a linear operator a is identified with a matrix from SL(n + 1, Z). The first row of this matrix 
simply gives the coordinates of feo in the basis {ej}j=o,...,n't the rest of the rows depend on a particular choice 
of the basis in the quotient lattice Z' i+1 /Zfe . Clearly fc defines a direct sum decomposition basis modulo 
SL(n,Z). 

Definition 1 A vector u) £ R™ +1 is Diophantine modulo fco with an exponent t > n — 1 and a constant 7 > 
(one writes lj £ W^i y 1 (ko)) if (feo,u>) = and there exists a direct sum decomposition basis {fcj}j=o,...,n 
generated by feo, such that for all fc £ Z™ +1 ; represented as k = (feo, . . . , k n ) g Z™ +1 in this basis, one has 

n 

\(k,u)\ > 7 \k\7\ where |fc| 6 = ^ \kj\. (2.2) 



Note that given fc 6 Z n+1 and a; G M n+1 , the "small divisor" (fc,a;) does not depend on the choice of the 
lattice basis {ej}j=o,..., n - However, the quantity |fc|g in the right hand side of l|2.2l) is not SL(n, Z)-invariant 
and can attain any positive integer value for a given k. 

Definition ^ is rather unwieldy, and in order to describe metric properties of Diophantine vectors one is 
forced to fix the lattice basis {ej}j=o,...,n- As the lattice element fco is considered fixed throughout the paper, 
one should naturally render {e.j}j=o,..., n and {fej}j=o,...,n the same basis. Then fco — (1,0, . . . , 0) and for any 
w such that (fco,w) = 0, clearly lj = (Q,lo) for some 10 € M n . Given the pair (r, 7), let us further use the 
notation 2U™ 7 for a set of ui £ W 1 , satisfying a stronger definition than Definition ^ 

2U?, 7 = jo; G R" : Vfe€Z"\{0}, | (k, u) \ > ~f\k\-\ Ife^^lfe^j. (2.3) 

The above is the standard Kolmogorov Diophantine condition 17 . Some well known metric properties of 
the set 22J™ (with a sufficiently small 7, see e.g. j3U10], also ^2] f° r complementary results) are that it is 
non-empty if r > n — 1, and for r > n — 1 has a full Lebesgue measure as 7 — > 0. 

2.2 Localization near a simple resonance 

For the Hamiltonian given a resonance fco, let p £ Q lie on a regular level set Hq 1 (E) and u)q = DHo(p ) 
satisfy Definition^ Then one can choose the lattice basis {ej}j=a,...,n with eo = fco, such that ujq = (0,wo), 
with uiq £ 2n" 7 , with some r, which is fixed throughout the paper and 7 = 70- Moreover, the origin for the 
action variables can be set at p Q , i.e. p = (0, . . . , 0). As H is defined modulo a constant, let E — H o (0) = 0. 
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This fixes the choice of the action-angle variables (p, q) and denoting p = (y,I) G K x K", q = (x, ip) S 
TxT", one can write down the following representation for the Hamiltonian 

H(p,q } e) = (u ,I) + ^(Q p,p) + O 3 {p) + E Hi(p,q,e), (2.4) 

where Qq is a constant positive definite matrix. 
Notation: 

N.l. The set of non-negative or positive, integer or real numbers is denoted as Z + or Z ++ , R + or R++, 
respectively. 

N.2. Bold lowercase symbols usually denote (n + l)-vector quantities. Uppercase symbols often but not 
always denote n-vector quantities. E.g. above p = (y,/), q — (x,ip); further g(q) = (g(q),G(q)). 

\ u ( z .)| 

N.3. The symbol notation u(z, ■) = O a (z; •), with ael, implies that lim — - — jj — , where II • II is the Euclidean 

z^o \\z\\ a 

norm, exists and is uniformly bounded from above for the whole range of the variables (•) (which may 
be omitted in the notation, as well as a = 1) by some constant C which may depend on (n, r) and 
perhaps other quantities fundamental for the problem, to be specified. C will be "as large as necessary" 
and may increase without notice. To suppress C (or C~ l ) in estimates, the < sign is often used instead 
of <. 

N.4. For real k, a > and j e Z ++ (j = 1 usually being omitted) let 

W K = {z e : ||z|| < 4, 

Ti = f {z E : ftz E V, \Qz\ < a} 

be complex extensions of a disk and a torus. 

It will always be assumed by default that n > 1, and it will not enter the estimates. All the analyticity 
and non-degeneracy parameters are by default positive, as well as e. In addition, if 5 for instance 
denotes the analyticity loss in the variable <p G T", it will be assumed by default that < 5 < a. 

N.5. Scalar functions 27r-periodic in each variable, holomorphic and uniformly bounded inside T£, whose 
restrictions on V are real-analytic form Banach spaces ^^(T 5 ), with topology induced by the supremum 
norm | • \ a . The space of all Taylor series with coefficients in 5^ (TP), uniformly convergent inside B£, 
with the supremum norm | • | K cr is denoted as Sk^TT). Referring to real-analytic functions on 
complex domain in the sequel means referring to their holomorphic extensions. 

The same notation stands for the supremum norms of vector functions. At places the subscripts in 
the norm notation can be omitted. If a function, whose norm is evaluated depends on additional 
parameters, omitting these dependencies in the estimates implies their uniformity. 

Thus, in the convex real-analytic set-up, there exists a set of parameters {70, kq, oo, Ro, Mo, 6q}, with kq > 1 
and £o^l, such that the Hamiltonian l|2.4|l satisfies the following. 

Model statement: 

1. For allee [Q,e ], H(p,q,e) e B„ , ff0 (T*T»+ 1 ) and \H \ Ko < M Q , \H x \ Ko ^ < 1. 

2. The frequency luq € 2H™ 7o . 

3. The constant matrix Qq is positive definite, HQo^H < ^o^ 1 - 

2 If feo / (1,0,..., 0) in the given lattice basis and the Hamiltonian H is given in the form 11' I . the representation 12.41 can 
certainly be achieved by means of a canonical transformation in the form 12.11 combining a symplectic rotation and a shift of 
the action origin to p 1411391 . With a specific H and fco in mind, the adaptation of the parameters in the main estimate 12.191 
in Theorem is straightforward, similar to how it will have embraced the parameter 9 in the sequel. 
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For a specific Hamiltonian 0/J the analyticity considerations may somehow single out the choice of the 
original coordinates (p, q), see e.g. JU]. As a result, the parameters above as well as the bounding constants 
may also depend on fco. 

2.3 Normal form near a resonance 

It is well known that one can come up with a normal form near a resonance |1| . Such a normal form for the 
Hamiltonian (|2.4[) was used as a motivation for the results of among others. Note that the normal 

form transformation belongs to the class 112. Ill and cannot be iterated in this form. 

As kq > 1, one can accept Mq and 1 respectively as bounds for several orders of derivatives of Hq and 
Hi in p. Assuming Ro < 1, let us get rid of this parameter as far as the quadratic part of Ho is concerned. 
Rewrite (12 .4|) scaling the time and actions by factor y/e/Ro, i.e. p — > y/e/Rop, H —* y/Ro/s H, and then 
divide the Hamiltonian by \feR~o (tantamount to yet another time scaling). Then 12.411 changes to 

H{p,q,e) = + ~{Qip,p) +e- 1 O 3 (y/e/R p) + H 1 ( x /^/R G ~p,q,e), 

with the notations 

uj-l = —==, Qi — R 1 Qo- 
V £ Ro 

The analyticity domain of the scaled Hamiltonian H in the scaled action p is now a complex ball of radius 
0(y/Ro/e). The matrix Qi is such that its spectrum is contained in [1, C(n)M /R ]. 

Decompose the quantity H\(Q,q,e) = H\(0, x, tp, e) into a 9?-mean U(x,e) and an oscillatory part: 

Hi(0,x,ip,s) = J Tn Hi(0, x,ip,e)d(p + (Hi(0,x, ip, e) - J T „ Hi(0,x, <p,e)d<p) 

= U(x,e) + {H^x^e)}. 

To get rid of the ^-oscillatory term {i?i(0, x, if, e)} consider a canonical transformation H^, which is tan- 
tamount to the shift p — > p + dS„(q,e). The 1-form dS^ is exact and is given by a 27r-periodic in each 
component of q = [x, tp) function S v , satisfying a PDE 

D Ul S v (x,ip,e) = -{JJi(0,a;, (p,e)}, 

with the general notation for u> S 1™ 

D u ^{u,D v ). (2.5) 

Note that x, e enter the above equation as parameters, in particular SV is defined modulo a function of x. The 
solution of this equation exists in a somewhat larger space than that for the right hand side. The following 
result is well known 31 . 

Proposition 2.3.1 Let to £ 2U™ 7 . For a function v £ Q3 CT (T™) with zero average on T n , the solution of the 
equation D^u — v exists in the space 5B CT '(T") for any a' < a. If a — a 1 = 5, s = r )8 T , then 

He < r>U \du\ a , < issrM*- 



Then given e small enough to ensure that the transformation ^„ be near identity, i.e. \dS v \ *C 1, suppressing 
e in the notation one gets for H v = H oE u : 

H v (p,q) = <wi,i)+ \{QiP,p) +U{x) + [/i(«) + <ffi(p,g),p>], (2.6) 

where 

< MoR^ldS^, \ Bl \ ai < M sup(R^^dS v \ ai ,\J^), 



7 



for some o~\ < do, with g 1 having absorbed the momentum-super-quadratic term. In view of Proposition 
12.3.11 the estimate for IdSVI^ is proportional to 7i<5q +1 , where 5q = ctq — <J\ is the analyticity loss and the 
parameter 71 = 7o(v / ^Ro) _1 characterizes the "fast" frequency toi G 22J",7i m the sense of (I2.3() . 
A characteristic feature of the simple resonance normal form H u is that its truncation 

H v , t (p,q) = (ujuI)+ \{QiP,p) + U{x) (2.7) 

is integrable. The pair (/i,«7i) can thus be treated as a perturbation, which requires it to be sufficiently 
small in comparison with in particular U(x) = 0(1), i.e. 

£<^X[infM ),i?o] 2 , (2.8) 
with the notation ?o = 70^0 ■ The results so far are summarized as follows. 

Lemma 2.1 (Normal form lemma) Let p G ft lie on a regular level set of Hq and for some ko G Z ra+1 , 
letoJo = DHq(p ) be Diophantine modulo (ko)- Suppose, the localization j2.4\ ) of Hamiltonian yj) nearp = p a 
satisfies the Model statement with the frequency ujq £ 1q an ^ ^ e se ^ of parameters {70, Ko, o~o, Mq, £0}. 
For o\ < Co 1 1st So = <Jo — o\ , ?o = 70^0 j suppose 

£o < Mo) 2 - 

For anj/ £ £ (0,£o), the Hamiltonian £JJ) can 6e casi mio i/ie normal form \2. 6)) where: 

1. H v G B„ 1 , (ri (I»T B + 1 ) 1 uritft /si = 0(Ro/V^); 

2. The constant matrix Q\ is positive definite, with HQ^H % 1, ||Qi|| J$ Mq/Ro; 

3. One has 

l/ik < eMoMo)" 2 , IffxU^M < ^(Mo/V^JIinfMo),^]- 1 . (2.9) 

2.4 Localization near separatrix 

Let us take a closer look at the truncated normal form (|2.7|) . The second clause of Lemma 12.11 implies that 
without loss of generality one can assume 

(Qip,p) = y 2 + 2y(e,i) + (ei, i), 

2 

with a constant vector G R™, and a constant positive definite matrix 8 € 1" , whose smallest eigenvalue 
is at least one (a greater than one coefficient multiplying y 2 being favorable). In order to proceed one needs 
the following assumption. 

Assumption 1 (Perturbation of general position) The function U = U(x, e) possesses a unique uni- 
formly non- degenerate absolute maximum on T for all e £ [Q, £0], with a characteristic exponent A G R++- 

Without loss of generality let the maximizer be x — for each e (which as far as the above assumption is 
concerned is non-essential and will be further omitted in the notation) with U(0) — 0. Assumption ^ then is 
tantamount to the claim 

U(0) = 0, £4(0) = 0, U xx (0) = -A 2 , A- 1 = 0(1); 

(2.10) 

Vx c £ T \ {0} : U x (x c ) = 0, U(x c ) < 0. 

For the truncated normal form Hamiltonian (|2.7(l the action / is an integral of motion. For 1 = one can 
single out a one-dimensional natural integrable system, whose Hamiltonian is y 2 /2 + U(x) . This is a reversible 
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Hamiltonian system in T*T = M x T. Near a zero energy level, its phase portrait is reminiscent of the classical 
pendulum, Fig. 1. There is a saddle (x,y) = (0,0) connected to itself by a pair of simple non-contractable 
curves y = ±y- 2 U(x), forming a single oo-shaped curve, further referred to as the separatrix. 

Let U(x) = X 2 Ui(x). Conditions i jSTTty allow one to define a 47r-periodic separatrix function ip(x), 
determined in general as well as the constant A by the pair (fcg, Hi) and possibly depending on e: 



f - ^-2U x {x) , xe[-2n,0), 

V /^R, xe[0,2.) (2 - n) 

(having chosen the branch of the square root where \/T = 1). Thus tp(0) — 0, ip x {0) = 1, V^a^O) = O(l) and 

■0 o Z 2 7r = — "0j ^27r : £ — > a; + 27r. (2-12) 

By (I2.10|) the function ■(/> has no other zeroes on the real axis, but even multiples of tt. For instance in the 
classical pendulum case U(x) — cosx — 1, i])(x) = 2$mx/2. In the general case one can write U\{x) = 
(cosx — l)V(x), with some real-analytic 27r-periodic function V, which has no zeroes on the real axis and 
V(Q) = 1. Therefore tp(x) — 2s'm(x/2)ipi(x), where the function ipi is real-analytic, 27r-periodic, has no zeroes 
on the real axis, and ipi(0) = 1. Thus the function ip is real-analytic and has no zeroes in some neighborhood 
of the real axis, except even multiples of tt. In particular, this property will be valid for \^sx\ < a% < <Ji, for 
some (72- 

In the full phase space T*T™ +1 the separatrix is represented by a Lagrangian manifold 

W t = {(p,q) e M" +1 x T" +1 : p = dS t (q) = (X^(x),0)}, (2.13) 

where the function ip is viewed as a double- valued function on T, corresponding to an exact double- valued one- 
form dSt on T" +1 , given by a (^-independent generating function St(%, f) = X j tp(x)dx, modulo a constant. 
The separatrix forms a coinciding unstable-stable manifold to an invariant torus T t at x = 0, see Fig. 1. 
One can localize (|2.6ll near the manifold W t . Let us make a formal change 3 y — > y + Xip(x) and denote 
the corresponding canonical transformation, acting as the identity on the pair (/, </?). The transformation 
L^ 1 acts on the base space variable x e T = K/27rZ as a period doubling map T — > T', where 

T' = R/4ttZ. 

The transformation changes the phase space to T*(T' x T") and incurs a topological change on the 
separatrix W t , doubling the point (x,y) = (0,0) on its projection on the (x, y)-plane, see the following Fig. 
1. The manifold W t now corresponds to the zero section of the bundle T*(T' xT"), with the identical zero 
generating function. 

Let H,p = H v o H^, now a 47r-periodic function of x: 

H t {p,q) - Xi>(x)y + (u; 1 +\ilj(x)d,I) + ~(Q 1 p,p) + [U(q) + {g i ,(p,q),p)]. (2.14) 

In particular fy, = /i + ipfip, where the function is determined by g 1 in Ij2.6|l . At the first glance after the 
transformation the bound for both and will be that for g 1 in l|2.9|l . However this is not quite the 
case, as one may recall that the generating function S u of the canonical transformation S„ in Lemma 12. H is 
defined modulo a function of x. Thus one can combine the two transformations S„ o into one with the 
generating function S v + X J ip(x)dx, which enables one to improve the above estimates as follows: 

_ (2.15) 
\g-4>U,a < VeMol^Ro inf( ?0 (So,i?o)] _1 . 



3 Note that one should not worry here about the analyticity domains, as n\ in Lcmma ^2.1l is large enough. 
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Remark 2.1: Without loss of generality, A < 1. As far as the power of the parameter Rq is concerned (the 
second entry in the sup and inf above) the estimates depend on whether or not the original unperturbed 
Hamiltonian Hq in JQ) contains super-quadratic terms. 4 

Along with let us denote a canonical transformation, effecting the shift y — > y — \ip(x), corresponding 
to the "lower" separatrix on the phase portrait of the truncated normal form Hamiltonian, Fig. 1. Studying 
the Hamiltonian = H v o further referred to as the sputnik of in essence adds nothing new, 

as is tantamount to followed by a shift Z 27r of the x-variable, in view of 27r-antiperiodicity of the 
function ip(x) and 27r-periodicity in x of the normal form Hamiltonian H v . On the other, the sputnik 
turns out to be a convenient way to describe on the interval x G [27r,47r). Denoting L 27r a canonical 
transformation, corresponding to the extension £ 27r : (x, 93) — s- (x + 27r, ip) of the shift Z 27r in the base space 
(be it T" +1 , T x T" or further R x T"; also let Z_ 27r = l^) one has 

L-^ o L 2 7r = L2tt o L^p and H-^ o L 2?r = 

after applying H u to the first relation. Using the same symbols L±^ for the transformations effecting the 
change y — > ?/ ± Xip(x) on T*(T' x T"), one also has 

-ffiy = Hif, O L-^ = H-jji O Lif,, 

and a useful identity follows: 

— H^o L 2rr ° L% = o L~ 2 o L 2rr . (2.16) 

For the flow of the Hamiltonian H^ x = H uX o on T*(T' xP), the invariant manifold W t contains a 
pair of invariant whiskered tori T uX and T Stt such x = on the former torus and x — 2n on the latter one. 
W t is the unstable manifold for T Myt and the stable manifold for T s>t . The two tori can be identified via the 
transformation L^ 1 . 

The transformation not only "doubles" the base space but also the separatrix. Indeed, the two 
branches thereof in the truncated normal form H v % are not only graphs over the base space, but also one 
over the other. Thus the tori T uX and T Stt not only possess an unstable/stable manifold respectively, which 
is W t , the zero section of T*(T x T"), but also its sputnik 

Wt = {(p,q) € R n+1 x (T' x T") : p = -2dS t (q) = (-2X^(x),0)}, (2.17) 

which is the stable manifold for T u t and the unstable manifold for T s t . This fact is in essence reflected by 
l|2.16|) : this is the symmetry in the Hamiltonian which enables one to identify 5 the manifolds W t and 
W[. On the other hand, W{. is clearly a flow-invariant zero section for the Hamiltonian -ff-^.,t = H„ x o L_^. 
Further a sputnik Hamiltonian will be marked by a prime, e.g. H-^p — H'^. 

Splitting problem 

As the perturbation in l|2.14J) is not identically zero (more precisely the zero order term thereof in the Taylor 
expansion in p) the manifold W t no longer lies inside the energy surface of H^, (for which it is the zero 
section). One can expect the following scenario, Fig. 1. 

A perturbation of general position causes the manifold W t to bifurcate, or split into a pair of distinct 
Lagrangian manifolds, denoted as W u and W s . If the perturbation is small enough, the two manifolds can 
be described as graphs of closed one-forms dS u and dS s . The forms dS UiS are well defined over the cylinders 

4 The perturbation in 12.61 is evaluated not only at y = 0, but also quite far away from it on the separatrix. The characteristic 
size of the separatrix in the original action variables of the Hamiltonian 12.41 is A ~ J sRq 1 , and in order that the super-quadratic 
term be considered as a perturbation of the (quadratic in momenta) resonant normal form, one should have e >> A 3 , thus 
e <C Rfy It is nevertheless irrelevant apropos of the estimates, regarding the preservation of the invariant torus at X = 0, in 
particular because i/>(0) = (one would have to look still closer at the structure of the acquired term to see that). 

5 By looking locally at the Hamiltonian vector field generated by H^, an observer won't be able to tell whether it is applied 
at a point (y, I, x, ip) or (y — 2\ip(x), I,x + 2tt, ip). Thus they won't be able to tell Wt and VK t ' as well as t u ,t and T s ,t apart. 
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I U:S x T™ respectively, where T u = [— 2ir + r, 2tv — r], for some r < 1 (characteristic of the quantity ip) and 
T s = 1-2tt(Z u ). Each manifold W u , s contains an invariant torus T u ,s, being the unstable manifold for T u and 
the stable one for T s . 

One should be able to identify the tori T u>a via the transformation . This in particular requires that 
both one-forms dS UfS belong to the same cohomology class: 

[dS u ] = [dS s ] = £ G E" = H l {l u . s x T",E). 

This fact is easy to establish due to the fact that the sputnik manifold W[ will split just the same, to which 
there will correspond a pair of closed one- forms dS' u and dS' s . It will be easy to see that say dS u and dS' u 
have the same cohomology class, as they in particular describe the same torus T u . On the other hand (|2.16() 
claims a congruency between the graphs of the forms dS' u and dS s . 

To measure the distance between the two manifolds W u and W s and to study their intersections, one 
naturally uses an exact 1-form d& = dS u — dS s , well defined on the union of two disjoint cylinders (T_ U 
T + ) x T™ where X_ = [— 2ir + r, — r] and T + = — T-. Let (3 G Z2 = {+, — } be the sign of x, then the notation 
d&p stands for the restriction of d& on Xp x T". The values of (3 = +, — respectively correspond to the 
splitting of the "upper" and "lower" separatriccs of the truncated normal form Hamiltonian H v x, see Fig. 
1. The splitting of the sputnik manifold W^, alias the zero section for H' = H-^ is described by — d&^p, by 

The generating function 6 p is real-analytic on Ip x T™, where it satisfies a linear homogeneous Hamilton- 
Jacobi equation, whose coefficients can be made constant via a change of variables. Analyzing the result of 
the latter change, one has the following theorem. 

Theorem 1 Let p Q G lie on a regular level set of Hq and for some fco € Z n+ , let u>o = DHq(p q ) 
be in 2U" 7Q (fco), corresponding to a simple resonance torus To with the foliation ^Sjj in terms of x € T. 
Suppose the localization \2.$ of Hamiltonian near p — p satisfies the Model statement with the frequency 
uiq € 2U7 ,t and parameters {kq, o~o, Rq, Mo, eq}- Suppose the perturbation Hi satisfies Assumption^with the 
characteristic exponent A and the separatrix function ip. For o\ < 00, let 5o = o~o — o\ , <ro = 70^0 ! suppose 

eo < {CMo)- 2 Ro[\ 2 inf(co<Jo, Ro)? = V 2 , (2-18) 

for some large enough C — C(n,T,ko,ao,ip)- 

For any e G (0,£o); continuously if H\{-,e) is continuous, there exists a pair of analytic Lagrangian 
manifolds W u ^ s , intersecting at an invariant n-torus T , on which the flow of 0) is conjugate to a rotation 
by Wo- Each manifold is locally a graph over TxT™. Away from T , the distance & between W u and W s is 
bounded by 

|6| < v/irT 1 E expf-|(fc,p^^+^)|-|fc| CTi y (2.19) 
fceZ "\{Q} V A V£ 1 

TT 

where p < —, 0*2 < &1, and the quantities p,o~2,& are well defined for H. The manifolds W u and W s also 
intersect along at least 2n + 2 orbits, biasymptotic to T. 

Remark 2.2: In view of the smallness condition l|2.18|l and the fact that 9 < Mo in (|2.19|) , the contribution 
of the quantity (72# in the estimate (I2.19f) becomes important for a Diophantine uio when |fc| ~ e~ 2 ( T + r > and 
will play an extra role if one attempts to estimate 1 6 1 from below . Where exactly the quantities p and 
CT2 arise is explained further, see in particular l|3.7l 13.8(1 and Fig. 2. 

It is possible to simplify (|2.14|) further by eliminating the constant 9 G R™ therein, letting 

x = x', y = y'-(9,I), , . 

<p = <p' + 9x', I = /'. [ V> 
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The base space transformation, corresponding to Eg will be denoted as ag. Unless 9 £ Q ra , the pre-image 
of the base space T' x T™ under the transformation ag is a bi-infinite cylinder Ix T". In other words, the 
transformation Eg almost surely results in the loss of 47r-periodicity in the "hyperbolic coordinate" x. 
If Hg = H^p o Eg then 

H e (p,q) = \i>(x)y + {u h ,I} + ^{Q 2 p,p) + [f e (q) + {g e (p,q),p)}, (2.21) 

where the matrix Q2 arises from Qi as a result of Gaussian elimination of off-diagonal elements in the first 
row and the first column. So Q2 is non-degenerate with the determinant at least one and the eigenvalue of 
largest absolute value being bounded in terms of MqRq ■ 

An apparent change of the analyticity domain of Hamiltonian <|2.21|l as far as the variables tp are concerned 
is easy to take into account; this will be done in Section 4. For now let us assume that Hg(-, Lp) is real-analytic 
for tp G T™ for some a. The only inevitable analyticity loss so far has been Sq in the application of Lemma 
12.11 The actions p live in a complex ball around the origin, whose radius is "as large as necessary" , provided 
(|2.8fl is satisfied. 

It is easy to see that 

Eg o L 4 , = L^o Eg, o Eg = Eg o L^g ° L 3 2n , 

where for j £ Z, L 3 2 g is a canonical transformation, corresponding to the base space diffeomorphism ll „ : 
(x, tp) — > (x, tp + 2nj9). Then (|2.16() gets modified to 

Hg=H e o L 2 3 o Lf® o Ll e = He o L~ 2 ^ o L 2 3 o L^ e , (2.22) 

where a(j) == | \ odef 1 ' ^ S ^ 6 P ar ^^ °^ 3' ^ n °^ ncr wol 'ds (|2.22l) reads 

Hg(y,I,x,(p) = Hg(y + 2a(j)\ip(x),I, x-2irj, ip + 2irj9) 

= Hg(y - 2a(j)\ijj(x - 2nj),I, x - 2nj, Lp + 2-njO). 

The manifold W t is now represented by the zero section of the bundle T*(R x T"), which contains unstable 
or stable tori 7} it for respectively even or odd values of j. All the unstable [stable] tori can be identified with 
one another via the transformation Eg 1 . In Fig. 1 the tori 7o. t ,7±,t correspond to j = 0,±1 respectively. 

Clearly, as it was the case with £f„, it suffices still suffices knowing Hg on the interval x £ [0, 2ir) only. 
The relation (|2.22l) applied to the truncated Hamiltonian 

H e , t = \i>{x)y + V - + (u u I) + ^(6J,J) 

2 

X^(x)y+ y - = V -{2X^{x)+y), 

one sees the sputnik manifold W^, where y = —2\ip(x). Naturally this manifold is the zero section for the 
Hamiltonian H' g = o Eg. This will transform the latter expression to 

V 2 

-\iP(x)y+!L-, 

which is tantamount to the shift x — > x + 2tt in view of 27r-antiperiodicity of ijj. Similarly changing y — > 
—2\ip(x) + y in the perturbation in l|2.21|l is tantamount to changing x — > x + 2ir and tp — > tp — 2ir9. 

It is convenient to treat Hg as a multi- valued real- analytic function on T*(T' x T"), whose branch is 
specified by fixing an even value of j in (|2.22() and j = suffices for consideration. The branches differ by the 
shift of the angle tp by an integer multiple of Att9. The splitting problem is well-posed for a chosen branch 
of Hg and the magnitude of splitting is clearly the same on each branch. Technically, first one restricts x to 
an interval 2q =%u- Theorem [21 furnishes a Lagrangian manifold Wq as a graph over 2q x T", containing an 



simply implies that if one writes 
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Figure 1: Splitting scenario 



invariant torus Tq near x — 0, for which it is the unstable manifold. Wq is described by a generating function 
So. Theorem |21 also results in the stable sputnik manifold Wq of the torus Tq, described by a generating 
function S' , such that one- forms dSo and dS' Q belong to the same cohomology class £ = [dSo]. 

Then x is restricted to an interval 21 = X s and one gets the Lagrangian manifold W— as a graph over 
2_ x T n containing an invariant torus 71 near x — —2tt, described by the generating function SI. By (|2.22|1 
S_ = S' Q o l 27r o 1-2tt8- Moreover starting from the pair (So, S_) (corresponding to j = 0, —1) using (|2.22|1 one 
can define pairs of manifolds {Wj,Wj) for all j, containing invariant tori 7} (Wj or Wj being respectively 
unstable or stable manifolds for even or odd values of j respectively) as graphs over l2nj(%o) x T™, with 
generating functions Sj = Sj_2 '-47r°'4-n-e, all characterized by the same £. In particular, S+ = S-ol_ iv ol i7r g 
corresponds to j = 1. 

All the manifolds Wj for even or odd j are identified respectively with W u or W s via the transformation 
S^ 1 , it suffices to introduce the splitting function in the same way as it was described above, identifying 
f3 = ± with j = ±1 respectively. With the same notation one has 6/3 = So — Sp, well defined on lp x T". 

This completes the construction of the normal form, and calls for a structural stability theory for Hamiltonians 
like Hg , which underlies the proof of Theorem^ Another goal is to make this theory amenable to the presence 
of the sputnik symmetry, in order to be able to conclude that £ = [dSj], Vj. Both issues are studied at length 
in the next section. 

3 KAM theory on semi-infinite bi-cylinders over tori 

As the forthcoming theory is self-contained, the notation in this section may be occasionally different from the 
preceding sections. E.g. the function ip(x) is introduced axiomatically, rather than by l|2.11|l . For structural 
stability no symmetry properties of ip are required, which on the other hand are essential for the splitting 
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problem. The details of the set-up arising in connection with the 27r-antiperiodicity of tp are not addressed 
until Section 0] 



3.1 Energy-time coordinates 
Time- map 

Consider a real-analytic function il>(x) for x in a closed real interval I, for definity containing the points ±7r 
in the interior. Suppose ip(0) = 0, ip x {Q) — 1, ip(x) ^ on I \ {0} and is uniformly bounded with its first 
two derivatives. Then ip(x) allows a holomorphic extension into some closed rectangular domain V C C, 
symmetric with respect to the real axis and containing X together with a ball B r at the origin for some r, 
such that ip and its first two derivatives are bounded inside T> and apart from that 1^(^)1 > r /2 in 2? \ M r . 
The pair (tp,T>) is regarded as fixed. 

For x € T> \ {0} consider a map s from T> into a Ricmannian surface S of the logarithmic type and its 
inverse x as follows: 

as well as 

X-S^V, X = ^ox. (3.2) 

The maps x, x can be represented by homonymous functions of a complex variable s £ C, which are 2iir- 
periodic and well defined in a family of semi-infinite strips about the rays 5is € (— oo, T], 3s = irj, j € Z for 
some T, see Fig. 2. They are real- valued on the above rays and vanish exponentially as 5fts — > — oo. Without 
loss of generality T > 1. 

Let us further consider only such values of the parameters r, T that < < r < 2r^ < 1, 1 < <T < 
2T^, for some fixed pair (r.0, T^) defined in terms of (ip, T>) and such that B2 rv , C T> and T^, > — 2 logr^,. The 
functions x(s), x( s ) are holomorphic in the half-plane 5Rs < —2T^. Further estimates will ignore constants 
depending on the pair (ip,T>) as well as constants rt, r in the Diophantinc condition (|2.3|1 by using the 
and >; symbols in an obvious way. 

For p > and s € C let 

A T p = {5Rs < T, |9fs| < p} U {5Rs < -21^} U {5Rs < T, |3s - tt| < p}, 

(3.3) 

a t i(9 = - a t, p , A T , P = A TiP n A T p 

be further referred to as complex bi-strips. Their projections on the union of the real axis and the line 3s = tt 
will be denoted by omission of the index p. The index T may also be omitted in qualitative argument. On 
the other hand, Aoo stands for a pair of lines RUl + i (the + or — sign henceforth having a priority over 
U, n). The difference between the case of a finite T and T = oo will be emphasized. Also define a bounded 
one-strip (rectangle) 

n TiP = {s e C : \Us\ < T, |3s| < p}, (3.4) 

with the same index drop rules. Clearly At, p = ^T,p U IFr.p + i7r. 

For any real-analytic function u on T>, the composition u = u o s returns real values for 3s = 0, 7r, let's 
coin the term "bi-real-analytic" for that. With the above notations for the domains, the functions x(s),x{s) 
will be referred to as bi-real-analytic for s 6 At, p , for some {T,p). For a function ip, given by (|2.11|l one will 
naturally have p < J in Section 4. 

Suppose s e A and h is a canonically conjugate momentum to x. Consider a canonical transformation S s 
from T*A into T*X as follows 

x = x(s). 

y = It (3 - 5) 

where the maps a;, x have been defined by (|3.1I I3.2|l . 
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Let us extend the maps s,x to maps s, x between A x T" and X x T" acting as the identity on ip £ T n , 
in accordance with the general convention of using bold symbols referring to the whole base space. Extend 
accordingly the transformation S s to H s , incorporating the pair (/,</?) £ T*T n . L e t 

C T = A T x T™, C T = A T x T n , C T = C T nC T (3.6) 

be referred to as bi-cylinders over tori, further just "bi-cylinders" . In particular, Coo = Aqo xP. In this 
section only semi-infinite bi-cylinders Ct will be dealt with. Bi-infinite and bounded bi-cylinders Cqo and 
Ct will come into play in Section 0] In qualitative argument, the index T, if finite (unlike T = oo) is often 
omitted further. 

Analyticity domains 

Let us describe more precisely the analyticity domains for the map s in order to further define the necessary 
function spaces on them. Technical difficulties will arise from the fact that the bi-cylinders Ct, Coo are not 
compact. E.g. a "near-identity" transformation a of Alexandroff compactification of Ct should not necessarily 
preserve {s — — oo}, i.e. the differential da may be unbounded. Similarly a Hamiltonian of general position 
on T*C may be unbounded as s — > — oo, unless the momentum h — 0. The reason is clearly because ds(0) 
does not exist. In a series of papers [3 dllSl, etc. these difficulties were overcome via improper integration 
techniques. 

Analyticity domains of functions involved will be characterized by positive parameter vectors p as follows. 
Let p = (r,T,p,a) £ R\ + . Introduce partial order p' = (r',T',p\a') < p if r' < r, V < T,p' < p, a' < a. 
If p' < p and |p — p'| = inf (r — r' ,T — T' , p — p' , a — a') > 0, write p' < p. Addition of parameter vectors, as 
well as multiplication by positive real numbers is defined component- wise, as well as the difference p — p' for 
p' < p. For A 6 A < |p| the notation p' = p — A means subtracting A component-wise. In the sequel 

the components and dimension of the parameter vectors p may vary; p can incorporate T = oo. 

Given T such that both points x : 5Rs(x) = T (of opposite signs) are in the interior of X, one may want to 
be able to describe the widest complex strip At,p for some p such that the image x(At,p) be contained in T>. 
This can be done as follows, see Fig. 2. 




Figure 2: The map x. The figure illustrates how the quantities p, oi can be determined (|3.7I) relative to the 
pair (ip,T>). The shaded region on the left, including the circle around the origin is the domain 1 r ,T,p <|3.9(l . 

Let the level set 5Rs(x) = T for x £ T> intersect the real axis transversely at a pair of points Pq S (7r,oo) 
and P n £ (—00,0). Let 7o,t be a connected component of the above level set containing the former point 
and 7,r,T - containing the latter point (the two 7's may coincide). The points Pq,P^ are connected to the 
origin by the level curves ?ss(x) = 0,ir respectively. Let Pq £ 70. t or t^t : ^ss(P^) = £. For the level set 
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9s (x) = Cj 2! G 2? , let 7^ be the connected component, whose closure contains the origin. Let 

p+ = sup{C : < C < n, P C G 7<, P-C G 7_ C }, 

p_ = sup{C : < C < 7T, P^+c G 7 ; +c , -Pr— c G 7 ;_ c }, (3.7) 



P = inf(p-,p+). 

In other words, the quantity p simply shows for how long the points Pq^ can be moved along the connected 
components of the level curves Sfts = T, to which they belong, so that the whole segment of the level curve of 
3s(x), connecting them to the origin remains contained in T>. Either p <G (0, n/2] or p = it, which corresponds 
to the case when 3?s = T is a simple closed curve contained in 2?, which together with its interior forms the 
image of the half-plane 5Rs < T in T>. If p < ^ (the only case of interest for the splitting problem) the equality 
|5s(x)| = p or |3s(a;) — tt\ = p can be achieved on four different level curve segments where £ = ±p or 
£ — 7r = ±p in Fig. 2. Marking these curve segments simply as 7*, j = 1, 2, 3, 4, define 

(T2 = inf sup l^xl. (3.8) 

In essence, these are the parameters p, 02 entering the main estimate (|2.19|) of Theorem ^ the quantity 02 
will not reappear until the end of Section 4. If one is willing to go into more detail, one should consider the 
above quantities as four- vectors to account for each 7*. 

For p — (r, T, p) denote 

2p = B r U x(A p , T ). (3.9) 
For complex extensions of the bi-cylinders defined by l|3.(j|) introduce the notations 

Ct, p , ct = A T , P x TC, (3.10) 

as well as a , Coo,p,o-, Ct,p,<t analogous to (|3.3I I3.6|l . 



Function spaces 

Defined below are the necessary spaces of bi-real- analytic functions on the bi-cylinders C as well as their 
maps. This is done simply via the composition of real-analytic functions on or diffeomorphisms of X x T" 
with the bi-real- analytic map x. One needs the following formalism in order to proceed toward an implicit 
function theorem for structural stability of vector fields or Hamiltonians on C or T*C to be further used for 
exponentially small splitting estimates. However, the theorem in question is interesting in its own right as a 
"non-compact" version of KAM theory. 

Let 5B J (2?), j € Z+, be spaces of functions real-analytic and uniformly bounded in T>, whose Taylor series at 
x = starts at order j; j • = will be further omitted. With topology induced by the supremum norm, 5B J (2?) 
are Banach spaces. Any u S (T>) can be represented as u{x) = x 3 v(x), where v S 58(2?), or alternatively 
as u(x) = ip 3 (x)w(x), where w S 05(2?). One can take the supremum of |d| or \w\ for an equivalent norm of 
m, the comparison constants depending on the pair (?/>,2?) only. For p = (r,T,p) define the spaces 58-' (X p ) in 
the same way as Q3 J '(X>). As J p C V, clearly W{V) C «8 J '(lp) C ^'(Ip,) for p' < p. For coherence with the 
forthcoming notation, let us write S5p (I) instead of (T p ) . 

If a function u G 2$p(I) has an extra analytic dependence in ip 6 T™, 27r-periodic in each component of 
(p, one adds an extra component a in the above parameter vector p and writes u € 23p(I x T"), u — u(x, <p). 
Define the set 58p(C) of all holomorphic functions u on Ct. p .<j, such that u — uox for some u S Sp(X x T n ). 
E.g. consider a graph y — D x S(x) of a one-form in the variables (y,x) € T*T. If ^(s) = S^rz^s)], then the 
corresponding graph in the variables (h, s) obtained via (|3.5|l is h — dS(s), so y = x~ 1 (s)dS(s) = 0(1). I.e. 
a (^-independent one-form dS(s) over C vanishes exponentially as s — > —00. 
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Q3p(C) is a closed subspace in the Banach space of all bounded holomorphic functions on Ct,p,<t and thus 
a Banach space itself, with the supremum norm | • | p . Note that if u G Q3 p (C), then the function u[s(x),p] 
allows analytic continuation into the neighborhood B r of x — 0, vanishing at x = to the jth order. This 
can be taken for an independent definition of the spaces <B p (C). Moreover, if u(s,tp) G 58 p (C), a multiplier 
X J (s) can be factored out, i.e. 

u(s,ip) = x J (s)v(s,(p), weQ3 p (C), \v\ p x \u\ p . (3.11) 

Section 4 will deal with bi-real-analytic functions on bi- infinite and bounded bi-cylinders. To this effect, if 
u G %$ r ,T,p,a allows a uniformly bounded analytic continuation as T — > oo, then write u € 2} r .oo.p.o-(C) or 

^ £ ^&r,p,<T (^oo ) ■ 

Besides 53t,p,o-(C) stands for the space of bi-real-analytic functions u(s, p), which are 27r-periodic in each 
component of <p and uniformly bounded in the bounded bi-cylinder Ct, p ,o- Also let %$T,p, <r(fl x T") be the 
space of real-analytic functions u(s, ip), which are 27r-periodic in each component of ip and uniformly bounded 
in the bounded one-cylinder n T , P x T™, defined by Clearly <B T , P , CT (C) = [V&TmQ 1 x t ")] 2 - With the 

supremum-norm | • |p, each of the above spaces is a Banach space. Component-wise supremum norm | • | p or 
the equivalent Euclidean norm || • || p will be used for vector functions. 

For any u G <8 P (C), there exists a unique decomposition 

u{s,<p) = u (<p) +u x {s,ip), where u G 2} CT (T"), u x G »J(C). (3.12) 
Using it, define the average (u) "at infinity" as 

(u) = f / u (p)dtp. (3.13) 

For it G 23p(C), its component u\ satisfies an obvious exponential estimate 6 in C p : 

\ u i( s ,f)\ S e s K| p . 

Let us further describe the maps of the bi-cylinder C induced by real-analytic diffeomorphisms of 1 x T™ after 
a change x = x(s). Given p = (r, T, p,a), a sufficiently small A G R ++ and p' = p — A, let 

a = id + b : q + b(q), q = (x,p) £ V x T>, b = (b, B) G [«B P (X x T n )] n+1 , |b| p < A (3.14) 

be a smooth map of I r ',T',p' x TT^, into 2T r ,T,p x T™, well defined for a small enough constant in the above 
estimate for |b| p . It will always be assumed that A < A^,, where the latter is "small enough" in terms of 
the pair (ijj,T>). The natural norm for b is the C 1 -norm in 23 P '(Z x T n ), which is easy to estimate knowing 
the C°-norm on some intermediate space 55 P "(Z x T") with p' < p" < p. Details regarding intermediate 
parameter values will be mostly bypassed. 

Let S) Pi a(I x T n ) be the set of all such diffeomorphisms and define S) p .a(C) as the set of all maps 

a : C p , -^C p , ]oe3 PiA (IxT") : a = soaox. (3.15) 

The analyticity indices can be dropped in the qualitative argument. For a one can come up with a unique 
representation a = a\ o do, where do : x — > x + bo(ip) acts on ip as the identity, while di = id + b preserves 
x = 0, i.e. b = (b,B) G 23 p (IxT") x [03 p (IxT™)]". Then a = a\oaa, where the transformation a\ preserves 
{s = — oo}. Naturally one can write a\ = id + b, where b — (b, B) £ [5B p (C)] n+1 . Indeed for the change 
ip — > p + B(x, ip) all one has to do is to define B(s, p) = B[x(s), ip]. As far as the variable s is concerned, the 

6 Clearly not any real-analytic function of s vanishing at infinity at an exponential rate will be a member of one of the above 
spaces. E.g. for u(s) = se s , the function u = n[s(x)] is not analytic at x = 0. 
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change of the x- variable corresponding to a\ can be written as x — > x + b(x, tp) for b £ 2$ p (Z x T"), i.e. one 
can write b = tp(x)v(x, <p), with v € 2} p (Z x T n ). Thus given x = x(s) one gets 

for some 6 € Q3 P (C), with the norm |o| p X |u| p < |fo| p . In particular the change of s under a\ is asymptotically 
an identity as s — ► —00. The above expression can be viewed as a C 1 functional from x T") into 58 P (C), 

mapping zero into zero and whose differential is bounded away from zero in some neighborhood of zero. 
Then by the inverse function theorem any function b S 23 P (C) with |6| p < A^, generates a diffeomorphism 
x -> a;[s + &(s,p)] = x + 6(a;,<p), with oe «8j(X x T n ). 
For the transformation ao it's easy to see that writing 



a>o(s,<p) = a [s,b (ip)} 



is as far as one can get, as the series expansion analogous to the preceding formula will not converge uniformly 
in s for 5is < — 2T^, i.e near x — 0. 

Remark 3.1: Unless bo = 0, the quantity ao[s, bo(<p)] is neither real-valued, nor continuous for real s < —2T^. 
Continuity can be achieved by extending it to {s = —00}, then the defining component ao or ao maps 
{9?s < -27^, 3s = 0, tt} U {s = -00} into {3?s < — T^, 3s = 0, tt} U {s = -00} and the differential 
dao = ds o ddo o da? is unbounded. Further calculations will use the expressions 



da 1 



i/)[s(x)+ba(<p)] dbp(ip) 

id n 



^[a^J+fioH] =x(«)+[l+X(*)»7i(a)]6o(v)+»j2[* J 6o(v)]^(v), (3-16) 



where the quantities 771, 772 viewed as functions of (s, y>) are in 25 P (C) by the assumptions on tp. 

As one is interested in the coordinate changes a G 3D p ,a(C) only as far as their action on functions from 
Q3 P (C) is concerned, they are naturally represented by an element b = (bo,b) of Q3o-(T n ) x p3 p (C)] n+1 = 

55p 0,1 ^(C), with the product topology and vector supremum norm | • | p , the origin corresponding to the 
identity transformation. 

Then if u € 25 P (C) and |6| p < A, u' = uoa(b) e *B P '(C) with p' = p — A. Moreover with p" = p - ~a one 
can write 

|tt-«oa|„/ < \du\ p „\b\ p , < -\u\p\b\p,, (3.17) 

by the Cauchy inequality. 

Apart from a = a\ o a 0j the general form for the transformation a can be also taken as 

„(6>=a (M + 6:{; : °™ t (3.18) 



In order to deal with functions of s, which are unbounded at infinity, let us introduce a function space 
«8p (C) ^ <B CT (T") x <8 P (C) as follows, see IpTTo)) : 



u(s,<p)e<B-(C) iff U (s >v ) = m^, z,(s,^)e«B p (C). 



The norm on Q5 p (C) is simply |u| p . Since one can write in the spirit of 1|3.12[1 v(s, ip) = Vo(<p) + x( s ) v i( s : p)> 
with vi G *B P (C), then 
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and sup(|vo|<r, \ v i\p) can be taken for the norm \u\ p as well. Also let 5Bp ' X \C) = Q5p (C) x [5B P (C)]™. An 
element of this space describes a bi-real-analytic vector field on the bi-cylinder C. Clearly *8i~'^(C) = 
^Bp 0,1 -'^), the elements of the latter space representing the maps of C. If g £ 25 p (C) is a vector field and 
a € X> p .a(C) then goa is not in Q5p, '^(C), however. Indeed, as the result of the transformation o,q the 

quantity v ° < i tp ) in the first component changes to — ; — , -^A^j — _ which can blow up for a finite s. However 
3 W ^[x(s)+6oM] 

da~ 1 g o a corresponding to the "new" vector field does belong to 5Bp, '^(C), see (|3.16[1 . Also, a simple 

calculation shows that in order to estimate the norm for partial derivatives of a function u £ 25p~(C) one can 

still use the Cauchy formula \du\p> < A _1 |w| p . 

As far as Hamiltonian functions on T*C are concerned, consider the Banach space 25 KiP [T*(2" x T n )] (with 
the sup- norm) of bounded real- analytic Hamiltonian functions on T*(I x T"), given by Taylor series with 
coefficients in 2$ P (Z x T"), uniformly convergent for the momenta p = (y,I) inside B" +1 , k > 1. Define the 
space Q3 KjP (T*C) of Hamiltonians on T*C as the subset of holomorphic functions on T*C, such that 

H £ »«,p(T*C) iff 3H e S £ , f [f(lxT")] : H = HoE s . 

Thus the members of 25 KiP (T*C) are given by power series in (^y,/), with coefficients in Q3 P (C). 

The final remarks on the notation are that sometimes, if it is clear to which of the above spaces a function 
u belongs, the norm of u may be referred to simply as |u| rather than |u| p . If u S 55 P (C) and has bounded 
partial derivatives, the notation |u|i p will stand for the C^^-norm. The notation \u\oo wu l stand for the 
supremum norm of u, restricted to the real values of all its variables, except s which is either real or = it. 



Conjugacy problem 

In the formal framework developed above one can set up a conjugacy problem for a class of bi-real-analytic 
perturbations of a constant vector field 

x a = \— + (Lo,—) (3.19) 
os dip 

on the semi- infinite bi-cylinder C, with A e R++ and a Diophantine u> £ 22J? 7 - In the same way as (|2.2Uf) . 
the unperturbed vector field can be taken slightly more general, i.e. 

x e = A— + (w + 9 {s , — , 
as dip 

with an angle- independent n- vector function 9(s), whose each component is a member of the space 25 J (C). 
This case is reducible to (|3.19f) after a change 



s = s', tp = tp' + \- 1 8{s + t)dt. 

J — oo 

The question of structural stability of the vector field xq under the group D(C) of bi-real-analytic maps 
of C is roughly as follows: given a vector field x — Xq + g, where g € 25p~(C) (with the parameter vector p = 
(r,T, p, a)) and \g\ p is small enough, does there exist a coordinate change a s ®p,a(C), i.e. a(b) = ao(&o) + b 
with a = (a , id) and b — (bo, b) £ Q5p?' 1 ' ) (C), p' = p — A, such that 

da~ 1 xoa = Xq? (3.20) 

The general answer to this question is no, as it is for the torus, for one can take x = Xq + with a constant 
£ £ R n+1 (aso being further identified with a constant vector (A, ui) £ R" +1 ). Note that within the map 
class 2) Pj a(C), the answer is no even if only the "longitudinal" component of the vector £ is nonzero, as the 
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scalings of the variable s are outside this class. Hence, conjugacy should be sought modulo £ G M n+1 , asking 
for a pair (a(b),£), such that 

da^ 1 (xq + g + o a = Xq. (3-21) 

The problem can be relatively easily shown to satisfy the input of an implicit function theorem of Nash-Moser 
type, following the papers of Zehnder who made further generalizations in the abstract set-up to 

embrace the KAM theory with its small divisors. One essential modification is that here one should deal 
with the differential operator 

D x ^ = XD S + (u, D v ) = XD S + D u , (3.22) 

rather than just D U) defined by (|2.5() . Auxiliary results apropos of solvability of linear PDEs involving the 
operator D\ tUI in the set-up of various function spaces introduced earlier are presented in Appendix A. 

However, estimates resulting from an application of the abstract theorem are unsatisfactory for the analysis 
of the normal form near a simple resonance (|2.6|) with its hierarchy of orders of magnitude and parameter 
dependencies. With extra scruple one can benefit by quasi-linearity of underlying equations, intermittent use 
of C 1 and C° estimates, similarly to the classical KAM case (33 . This is done in Section l3~2l resulting in 
particular in Corollary 12. 21 



Application to Hg 

Let the function ip be given by (|2.11|) . Then the domain D can be taken as a closed rectangle of some 
semi- width, bounded from above by ai, symmetric with respect to the interval Xq — [— 27r + r, 2-7T — r] of the 
real axis for some r < inf(cri, 1). The pair (ip, V) as well as the parameter bounds r^, T^, are well defined, 
in particular one can ensure T^, > 1, because the mean value of ijj over T' is zero. Then given T £ iT^,2T^) 
one can use (|3.7I3.9[) see Fig. 2, to determine the constants p and 02 as well as the domain X v . 

The application of the transformation S s to the Hamiltonian l|2.21|l . whose a;- variable is restricted on I p 
with the notation (p; q) = (h, I; s, tp) results in the "new" Hamiltonian H s = Hg o S s 6 Q3 KiP (T*C) as follows: 

H s (p,q) = Xh + (wi,J) + {Q 2 p,P) + fe[x(q)] + (g e (p,x(q)),p) 

(3.23) 

= H X x(p,q) + V t (p,q), 

where p = = (y,I), and the function x( s ) is defined by (|2.11l l3~2^l . Or, including the momentum- 

dependent part of g g into the "unperturbed" Hamiltonian: 

H s (p,q) = Xh+(u 1> I) + 2 (p;q) + [f(q) + (g(q),p)] 

(3.24) 

ee H x (p,q) + V(p,q), 

where f(s,ip) = f e (x(s),ip), g(s, <p) = diag(x _1 ( s )> id n) X ( S ),<P)- Namely g e *B^ 4) (C) and the differ- 
ence between V and V t is that the latter may contain super-linear terms in p. For the sputnik Hamiltonian 

H' s = H' e oS 8 G 5B K ,p(T*C) one gets 

H' s = H s o i^ 2 , with L x = L^f, o 5 S , L x : (h, I, s, tp) -> (h + Ax 2 (s), /, s, tp). (3.25) 
Then similarly to the two previous formulas 

H' a (p,q) = H_ Kt (p,q) + V{(p,q) 

= -Xh+ + 0' 2 (p; q) + [f(q) + (g'(q),p)}, 



(3.26) 



i.e H_\ t is the same as i?A,t, but for the sign of the first term and V£ = V t o L x 2 , whereupon the terms 
/' G 2J P (C) and g' G V&p~' 1 \c) correspond to the zero and first order terms of the Taylor expansion in p; the 
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rest of the expansion is absorbed by the term 0' 2 (p; q). Just the same, V will denote the expression in the 
square brackets in l|3.26|) above. 

By Lemma 12.11 one can take the radius k of convergence of both Taylor series in p as large as necessary, 
for instance k > 2 and for all p £ B K assume that uniformly over the (complexified) base space, the absolute 
value of each eigenvalue of DppH s and DppH' s is uniformly bounded from below by 1/2 and from above by 
a constant times MqR^ , 

3.2 Hyperbolic KAM theorem 

This section develops a KAM- type approach to Hamiltonian systems in T*(C), such as the transformed simple 
resonance Hamiltonian l|3.24|) . The section contains the statement of Theorem [2] the principal part of its 
proof and a number of corollaries, one of each is the solution to the conjugacy problem (|3.21|) . Yet the 
prototype of Theorem [21 can be found in [31] the theory exposed below is quite different in flavor, as it avoids 
compactification of the base space and elucidates the connection with other methods of study of manifolds 
asymptotic to invariant tori, developed for instance in [7J, |13|. [37], The theorem contains significant 
technical improvements, allowing for a supposedly optimal parameter dependence, necessary in order to make 
the theory applicable to a generic simple resonance normal form and for the purpose of studying the lower 
bounds for exponentially small splitting ^U|, |33| . 

Theorem [2] per se represents an alternative to the traditional approach to whiskered tori, largely due to 
Graff 16 . The present approach appears to be more natural for describing the whiskers as semi-infinite 
cylinders over tori globally 7 , as it yields their representation via generating functions, solving the Hamilton- 
Jacobi equation in the setting of the Banach space families, introduced earlier. This representation with an 
extra sputnik property is ideally suited for describing the splitting problem in Section 4. In addition, it can 
serve as an initial set-up for the variational construction of orbits, shadowing the whiskers' intersections [S]. 

Theorem [2 applied to the Hamiltonian 13.24[) implies structural stability of its principal part, when the 
momentum-subquadratic part of the perturbation (f,g) is small. In case it is zero, the zero section of 
the bundle T*C is an invariant Lagrangian manifold. Under the perturbation, this manifold persists as a 
graph of a closed one- form dS over C. Throughout the proof, a rapidly vanishing sequence of generating 
functions {«S r j}jeZ+ is constructed, starting from So = 0, such that the graph of the one-form d^2^ =Q Sj 
approximates the Lagrangian manifold in question with increasing accuracy as N — ► oo. The manifold itself 
is then described by the generating function S = J2JLo ^ n order to obtain the sequence {Sj}, a KAM 
type iterative procedure is developed. A single step of it is described by the Iterative lemma. Namely, for 
j G one constructs an "affine" canonical transformation on T*(C), i.e. 

with (p ,q ) = (p,q). 

A closed one-form dSj is given by the generating function Sj(q) = Sj(s,tp) = (£j,ip) + Sj(s,(p), where 
£j G R™ is the H^-iCjM.) cohomology class representative for dSj, and the function Sj(s,ip) is a zero-form 
on C, i.e. is 27r-periodic in each component of ip. A single pair (a,j,Sj) € ^p j _ 1 ,A j (C) x Q5p, (C), where 
p' < pji < pj < p = p = (r, T, p, a), the initial set of analyticity parameters, Aj = \pj-i — pj\ for 1 < j < j' , 
and linij^oo pj = p', component- wise. The function Sj is chosen in order to eliminate to the leading order a 
momentum-independent term / in the perturbation. The transformation Oj is an approximate (first order) 
solution the conjugacy problem l|3.21fl . Its existence is ensured by the specific choice of the quantity £j. Upon 

7 Graff's theorems apply to partially hyperbolic tori of all dimensions < n, but are in essence local near a hyperbolic 
equilibrium, where the whiskers, being very "short" cylinders over tori, can be naturally described by naive generating functions. 
The present theory takes advantage of the fact that in the simple "one-hyperbolic" case discussed, the truncated normal form 
Hamiltonian 12.71 is integrable and the generating function St 12.131 is defined globally over a "long" cylinder; this necessitates 
a considerably different analysis. The present approach seems to be extendable to the case of lower-dimensional tori, despite in 
the latter case one certainly cannot hope to have a simplistic global description for the whiskers. Nevertheless, one may try to 
consider only narrow strips thereof in tubular neighborhoods of transverse homoclinic orbits, generically existing in the phase 
space of the non-integrable "hyperbolic" sub-system, corresponding to the truncated normal form if one lets dimx > 1 in 12.71 . 



21 



application of the transformation tyj, the principal part of the Hamiltonian (modulo a constant) picks up a 
term equal to a small constant times h, thus slightly changing the value of A. 

As j — > co, the Hamiltonian vector field in question, restricted to the manifold p = dS thus becomes 

conjugate to l|3.19[) (with a slightly changed value of A) via a canonical transformation = Si o H2 o 

Note that the classical Kolmogorov's theorem ^7j allows a similar geometric interpretation 0J| with the base 
space, of course being a torus, rather than a bi-cylindcr. 

Set-up and statements of Theorem 12*1 and Iterative lemma 

The unperturbed Hamiltonian in (|3.24|) belongs to a certain class. Fix a Diophantine u> £ 2U™ 7 and define a 
class 91 w as follows. 

Definition 2 (Unperturbed Hamiltonian) A function H\ on T*C belongs to the class 0T W if modulo a 
constant, it can be represented as follows: 

H x (h,I,8,tp) = \h+(u,I)+0 2 (p;q), (3.28) 

where A 7^ (further assumed positive ) and 

1. 02(p,q) — C>2(^jj,I;q) 6 Q3 K , P (T*C), for some parameter vector (n,p); 

2. 3R,M € K++, such that V(p,q) € B^ +1 x C„ || (D 2 n 2 (p; q))" 1 1| < R' 1 and ||D$pO a (p; q)\\ < M. 

Given H\ G 9t w , consider its small perturbation 

H = H X + V, V(p,q)=f(q) + (g(q),p), (3.29) 

where 

/ S «8 P (C), g S B$ _,1) (C), |/| p < n, | ff |p < ^"S for some < M < !/ < 1. (3.30) 

The parameter v is further used to obtain the desired smallness condition l|3.33[) generally indicating that 
the above described iterative procedure allows larger upper bounds for the norm of g than the norm of / in 
the perturbation V . 

Theorem 2 (Hyperbolic KAM theorem) // [i is small enough, there exists a canonical transformation 

* = *(a,S):( 9 = , jo (3-31) 

y 1 J \ p = '■(da) p' + dS, y ' 

such that for any k' < n, p' < p and some new parameter values A', R' , M' , different from A, R, M respectively 
by O(i-i), one has H o ^ e 9T W , with the new parameter set {n' ,p' , A' , R' , M'} and A = |p — p'\: 

1. The transformation a — a(b) € 2D p .a(C) ; with |&|i,p' = 0(jx). 

2. The one-form dS is defined by the generating function S(q) = (£, ip) + S(s. ip), with £ G M™, S S 2$ P ' (C), 
and \Z\,\S\ hp , = 0(ji). 

In the above non-technical formulation of Theorem |3 small enough" means that it satisfies the following 
smallness condition (|3.33|l . The symbols O(n) depend on the parameter values from both the old and the 
new parameter sets. Further without loss of generality, one can assume that the quantities S = a — a' , A = 
|p — p'|, A, R, M , M -1 < 1. The exact estimates are summarized below. In applications, one or more of 
them can turn out to be functions of a small parameter e, and the magnitudes of the analyticity loss in the 
variables (s, ip) can differ considerably. Then the following estimates can be adjusted if necessary, see e.g. 
footnote ^2 below. 
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Parameter statement of Theorem [5] 

Let 

k' = k-1, ? = inf(7<5 T ,A), 77 = Rmf [M~ 1 <;a, v). (3.32) 
There exists a constant C, depending only on n,r,ip,V, such that if 

fi < C~V < (i?/Af) 2 A 2 [inf(cr,t/)] 2 , (3.33) 

the following estimates hold: 

\S\ P > < CimT 1 , \dS\ 9 , < Cfi^A)- 1 , \dS\ p > < Cfir)- 1 , 

\b\p> < Cfi^)- 1 , \db\ p > < Cute*)- 1 , 

(3.34) 

A _1 |A'-A| < Cfi{nX)-\ 

M-^M' - M\, R-^R' - R\ < CftfaA)- 1 . 

Theorem[2]can be cast into the abstract generalized Newton method framework However, in order 

to obtain the desired parameter dependencies, a direct proof is given. The main tool is furnished by the 
following lemma, fulfilling a single Newton's iteration. 

Lemma 3.1 (Iterative lemma) For a Hamiltonian l3.28H3.3Up with a parameter set {k, p, A, R, M, fi, v\, 

if ji is small enough (condition \3.3b]) below), there exists a canonical transformation 

* = *(a,5'):( 9 = Z^'l-i / 1 ct (3.35) 
v ' ; \p = t (da) p' + dS, y 1 

such that for any n 1 < k, p' < p, and some new parameter values A', R' , M' , different from A, R, M respectively 
by O(fi), one has for A = |p — p'|: 

1. The transformation a = a(b) G £> Pi a(C) *s such that \b\\y = 0(fij. 

2. The one-form dS is defined by the generating function S(q) = (£, ip)+S(s, ip), with £ € R™, S € 58 p /(C), 
and |£| liP , = 0(m). 

3. The Hamiltonian flof = H\> + V , with H\> G Vt^ satisfies Definition^ and k3. 3(J\) with a parameter 
set {«/, p', A', R' , M', //, u'}, where // = 0(/i 2 ) and v' is independent of \i. 

Clearly, the quantities a, S, etc. in the Iterative lemma are not the same as their homonyms in Theorem[21 
it should not cause confusion. In the non-technical formulation above, the quantities fi', v' and the symbols 
O(fi) depend on the parameter values from both the old and the new parameter sets. They are further 
specified as follows. 

Parameter statement of Iterative lemma 

Let k — k! < 1 and s,rj be computed via formulae \3.32ty . There exists a constant C, depending only on 
n,T,ip,T>, such that if 

fi < C~Va(k-k'), (3.36) 
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the following relations hold: 

\s\ p . < c^-\ \ds\ v , < c^y\ lei < c m -\ 



\K' < c^,)- 1 , \&\ p , < cm^a)- 1 , 

A _1 |A'-A| < Cn(T]Xy\ (3.37) 

M~ l \M'-M\, R- l \E!-R\ < Cfi^A)- 1 , 

v 1 = mi(M-\^v), fi' < C 2 n 2 i]- 2 . 



The proof of the Iterative lemma is given in the next section. Detail of the proof is important in order to 
justify the following remarks, concerning Theorem[21 Once one accepts the Iterative lemma, the rest of the 
proof of Theorem |21 becomes a routine iteration scheme, given in Appendix B. 



Corollaries and remarks 

Theorem |2 essentially states that the zero section p' = is an invariant Lagrangian manifold for the Hamil- 
tonian H o <]> e < yi ul . Due to the "affineness" of the transformation ^(a,S), in the "old" coordinates (p, g) 
this manifold is a section of T*C, given by 

W(s, <p) = {(h, I, s, ip) G R"+! x C : h = Dj^s, <p),I = £ + D v S(s, p)}, (3.38) 

where S G 58 P '(C), i.e. S(s,ip) = § (ip) + §i(s,ip) in the sense of (|3~T2|) . Recall that C = A U T" is a 
bi-cylinder, i.e. 3s = or n. 

The manifold W(s, (p) has been compactified by incorporating an invariant torus T corresponding to 
{s' = — oo}. The flow on T is conjugate to rotation with the frequency u>. With S' — S o a and q' — (s' , ip'), 
another parameterization for W is 

W(q') = {(p, g) : g = a(g'), p = W^Of 1 d5'(g'), Q' e C}, (3.39) 

and get T as 

T = p| v') G A T x T™]. (3.40) 

T<0 



Remark 3.2: In the sequel T will be often described as located "near" s = — oo: this verbiage refers precisely 
to the representations I|3.38I3.40|) and the underlying estimates. 

The representations (|3.38|l constitute the basis for the splitting analysis in Section^ For easier cross-reference 
let us recap the above as a corollary. 

Corollary 2.1 (Hamilton-Jacobi equation) Let Hq(0, q) — 0. The function S (q) = (£,>p)+S(q) satisfies 
the Hamilton-Jacobi equation on Q3 P '(C) : 

H{d q S{q),q) = c , (3.41) 

with a real cq bounded by \S.51)) . 

Let us now look back at the conjugacy problem Ij3.21|l . It corresponds precisely to the case / = in the 
perturbation (|3.29|) . Then the parameters M, R, v are redundant (can be all set to 1 in the estimates) and the 
unperturbed Hamiltonian H\ can be thought momentum-linear. Then given the perturbation g £ Q3p :1 ^(C), 
the aim of the conjugacy problem is to find a transformation ^f(a) — ^>(a, 0) to conform with l|3.31|l that is 
with S = 0, as well as a constant £ G M n+1 (which is not unrelated to the vector (A — A',£) G R n+1 ) such 
that {H\ + (g + £,p)) o = Hy . Then the proof of Lemma f3. II and Theorem can be straightforwardly 
adjusted to yield the following corollary. 
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Corollary 2.2 There exists a constant C(n,T,Tp,U), such that i/g 6 <8 p (C), with 

\o\p < c-\a, 

there exists a pair (a, £) E (33 p .a x R n+1 ), such that the map a = a(b) effects \3.21\) and 

|£|,|b|i, P < < C\ 9 \ p (Xa)-\ (3.42) 

Besides, if H = H o E s , then the transformation S s o <]/ o Sj 1 effects the structural stability of the normal 
form 

H\(y,I,x,<p) = Xi)(x)y+ + 2 (p;x,ip) 

under small perturbations, proving the existence of an invariant manifold which is a graph over the variables 
(x, tp) E [— 2tt + r, 2ir — r] xT", containing an invariant torus near x = |34| . 

The next corollary is quite obvious with respect to H. It claims that if the latter is perturbed by a 
pair (f,g) such that f,df,g all vanish at x = 0, then the invariant torus at x = satisfies the Hamilton 
equations and does not move: in particular the constants £, Co in Theorem|21and Corollarv l2 . II should be zero. 
For instance, this is the case in Arnold's example £Q (the Hamilton-Jacobi formalism for such a degenerate 
perturbation was developed in [SZj). This fact can also be established by going through the proof of Lemma 
13. II and will be used further to claim Corollarv l2.4l essential for the splitting problem. 

Corollary 2.3 (Degenerate perturbation) Suppose f G 23^ (C) and g = (g,G) G Q5 p (C)x [<Bj(C)] Tl . Then 
apropos of the transformation fOT)) one has £ = 0, S G ©p/(C), b = 0, b = (b, B) G *B P (C) x [<8j,(C)] n and 
Co = in Corollaru \2.1\ 

The following remarks address yet more technical issues. 

Remark 3.3: (Estimates) The smallness condition l|3.33|l for /i appears to be optimal as far as the parameter 
dependencies are concerned: if A, v = 1, A = S it reproduces the standard KAM theorem optimal smallness 
condition, see e.g. [29| . The use of an extra parameter v has been essential here to express that the order of 
magnitude of g is inherently somewhat greater than that of /, as far as the perturbation is concerned. This 
fact can obstruct accessibility of the condition (|3.33|) if one pursues Kolmogorov's approach to KAM theory, 
see e.g. Similar (standard KAM) estimates resulting from the general abstract implicit function theorem 
machinery are also worse UJJ, Under the assumptions of Corollary 12.31 the estimates of the Parameter 
statement of Theorem I A . 1 1 shall be modified as follows. Apart from £ = 0, one should use <r = A and formally 
set R = 1 in all the estimates. Finally, if A < in Definition then obviously |A| should substitute A in the 
estimates. 

Remark 3.4: (Local 8 uniqueness and parameter dependence) Given H\ and a small perturbation (/, g) 
obeying the smallness condition (|3.33|l . the pair (a, dS) is unique (as it is in standard analytic KAM theorem). 
Indeed, the unique solution u(v) of the PDE in Proposition IA. ll provides the right inverse of the operator 
D\^, which is also its left inverse, guaranteeing uniqueness, see |4()| . In other words, local uniqueness 
follows from the uniqueness of PDE solutions (modulo a constant) in Appendix A. Similarly, in the case of 
a continuous (e.g. on the pair (f,g)) or real-analytic dependence of H in an extra parameter (e.g. fi), the 
pair (a, S) retains the same type of dependence in the parameter. Local uniqueness is indispensable for the 
splitting problem to be well-posed. 

Remark 3.5: (Other settings) It is known in KAM theory that the non-degeneracy assumption in Def- 
inition allows many variations |32| . Theorem [5] can be adapted to these settings in the same way as the 
standard KAM theorem. For instance H\ in H3.29(l can be only linear in the actions J, provided that the per- 
turbation V does not depend on / either, the so-called "isochronous" case. In this case the transformation 

8 This is certainly not true for "large" perturbations: each hyperbolic manifold (unstable or stable) should have a counterpart 
(stable or unstable) or sputnik, see the coming Corollary 12. 41 
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a acts on the ip- variables as the identity, and in the smallness condition (|3.33|) one can certainly set R = 1. 
One can pass smoothly (so-called "twistless" case Ql]) to the isochronous case from the set-up of Definition 
[21 by introducing an extra parameter, which would multiply all the terms containing / in the Hamiltonian, 
except (uj, I). This can be verified by examining the proof of Lemma |3. II in the same fashion as it is shown 
for the standard KAM theorem |2U] . 

As this paper has been motivated by the need for a general theory for exponentially small splitting near 
resonances, and the main estimate Q4.13H of the ensuing Theorem [3] will not persist in the C r category (even 
to the first order of perturbation theory) the latter setting has not been considered. However there seems 
to be no obstruction to generalizing Theorem |2 for C r Hamiltonians (with r large enough) using standard 
smoothing techniques 0U1 E3 ■ 

Suppose the Hamiltonian H\ has extra structure admitting an unperturbed sputnik, similarly to 13.23I3.26|) . 



Assumption 2 (Sputnik on Ct) There exists a canonical transformation 

U = L*(a*,S*) : { q = fj q :}-i j cy* i \ (3-43) 
V ' I P = '(da*) p,+dS*(q), y ' 

where a* is a map ofCx, a* — id + 6*, with b* G [QS p (C)] n+ , while dS* is exact, with S* G 2Jp(C), such that 
both Hamiltonians H and H' = H o L* satisfy the input of Theorem^ with the same analyticity parameters 
K,p and equivalent 9 non- degeneracy parameters A, R, M and smallness parameters y~,v. Moreover, see \S. 2& - 
\3.3U\) . suppose H\ o L. t = Hy + V, where H\,H\i G 9T W7 |A'| x |A|, while the perturbative term V' satisfies 
the assumptions of Corollarv \2.Si 



Remark 3.6: Regarding the normal form Hamiltonian (|2.6|l alias I|3.23I3.26|) , the sputnik transformation 
L„ = L~ 2 arises from the fact that the lower separatrix is a graph of over the upper one for the truncated 
normal form Hamiltonian (|2.7|l . To this effect, the action of i* is H\ t oL, = H-\ t , natural for symplectic 
flows. So in this case a* — id. However further in Section 4, the diffeomorphism a* will incorporate a shift 
of the variable <p by 2f3ir8, (3 G {+,— } (and of s by —f3iir), see (|2.22ll and 14.4fl to come. Assumption [3 is a 
generalization, which may be useful for instance in case of higher multiplicity resonances when there is more 
than one characteristic exponent A, see also footnoted 

Note that as a* acts as the identity on the set {s = — oo} (possibly causing reparameterization of the angles 
(p in terms of ip*) it is not only S* but also S* o a* G Q3p(C), and the unperturbed invariant tori for H and H' 
can be identified. As H' satisfies the input of Theorem [3 there exist a canonical transformation ^'(a 1 , S* 1 ) 
and an invariant manifold W' containing an invariant torus T 1 , versus the transformation ^(a, S) and the 
pair (W,T) for H, described by (j3.38l I3.4()|l . Note that the claim = L^ 1 o v|/ would be false, as the latter 
transformation would still make the manifold W the zero section, rather than W' . 

Corollary 2.4 (Sputnik) Under Assumption^ let ^{a,S) : H o * G Ot^, ^'(a 1 ,^ 1 ) : W o g D1 w 
according to Theorem^ Then the closed one-forms dS and dS 1 belong to the same cohomology class 

£ G -ff^CjK.). Both dS and dS' = dS* + dfS" 1 o (a*) -1 ] belong to the same cohomology class and satisfy the 
Hamilton- J acobi equation j3.41)) of Corollarv \2.1\ for H, on the same energy level cq. The closure W' of the 
graph of the form dS' intersects the manifold W defined by for the form dS at the torus T defined by 

f£TV\> , i.e T =T. 

Proof : Let H^ = H o <]> o L*. Since H o $ G by the explicit form (|3.43|) of i*, under the assumptions 
on the pair (a*, S*), the Hamiltonian H2 = H' o (L^ 1 o$oL t ) satisfies the conditions of Corollary 12.31 for 
pi small enough. 

9 Equivalent here is meant in the same sense as the equivalence for orders of magnitude or norms, that is up to a constant 
factor. Clearly \,R,M in Theorem |5] can be just bounds, rather than the actual values of non-degeneracy parameters. I.e. a 
change A —* —A or M — > 2M is inconsequential. Besides the transformation L* may in principle entail some extra analyticity 
loss of the order A. Necessary amendments are easy to make for a concrete example. 
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Indeed, (ffo$)oL, = (H\ + [c\h + 02(p; q)]) ° 7*, where c\ is a constant and the term in square brackets 
is 0(/i). Then the assumption on 7* (in particular its preservation of {s = — oo}) implies the previously 
made statement, namely 

H 2 (P* , q*) = A2/1* + {u>, I*) + 2 (p, ; q m ) + V 2 (p, , q* ) , 

where A 2 = A' + 0(/i) and p, = (^_, /„). 

In fact, the pair (f2,92)-> comprising the perturbation V2 above, which has arisen as the result of substitu- 
tion of i|3.43|l into the Hamiltonian H 0$ can be bounded in terms of |db| p /A _1 times a constant, determined 
by on 'J'*, see 1)3.341) . As S*{q Sf ) is proportional to x 2 ( s *)> tne quantities j 2 (?*), 92(9*) behave as — > —00 
amenably to Corollary 12. HI 

Then for /i small enough 10 there exists a canonical transformation ^2 — ^2 (0-2, £2) in the form 1)3. 31|) . 
such that H 2 ° ^2 € 9^. The flow of the Hamiltonian H 2 (p :t ,q^) = A 2 /i* + (w, 7*) + ... contains an 
invariant Lagrangian manifold W* = {p„ = dS^q*), q* G C} near the zero section {p^ = 0, q^ € C}, with 
the generating function S2 G 58?, (C), containing an invariant torus T, corresponding to the limit as s* — > —00 

and p^ = 0. The same flow also contains the invariant manifold W = {p* = — dS*[a*(q t )], q.^ € C}, i.e. the 
closure of the zero section for H o <I< (set p = in (|3.43(l ). containing the same torus. Note that both 
[dS*] = [dS 2 ] = 0. 

The claim now is that W* = W , by local uniqueness. Indeed, T^cl/oL^ovE^ = H "o L^o (7" 1 o\& o L*oty 2 ) E 
9t w and the transformation in the parentheses is near identity. On the other hand, (H o L*) o f ' G so 
one must have = L~ l o$oL,o v]/ 2 . for // small enough. Both the left and the right hand side of the latter 
identity must have the (unique) form (|3.31|) in terms of the pair (a 1 , S 1 ), which by Theorem|3is well defined 
as long as /j, satisfies (|3.33|) . Then W and W clearly belong to the same energy level. 

This essentially completes the proof. One may notice that there is a natural semidirect product structure 
that the canonical transformation composition induces on pairs (a,S). Namely in the "old" coordinates 
(p, q) the manifold W 7 * is represented in terms of the generating function S' = S* + S 1 (a*) -1 , on the other 
hand equal S + S* o a -1 o g 2 o (a o a*) -1 , which (as [dS*] = [c?5 2 ] = 0) clearly implies [dS] — [dS'], i.e 
S(s, tp) — (£, tp) + S(s, tp) and S'(s, tp) — (£, tp) + S'(s, tp), with the same £. However, despite S and S' are 
both elements of Q3 p , (C), one cannot claim that necessarily Sa(ip) = S' (ip) in the sense of the decomposition 

arm, . □ 

Remark 3.7: (Notation) The final remark in this section is that the theory developed above can obviously 
be applied to the restriction of the Hamiltonian H(-, s) to "one strips" s G {5fts < T, \$ss\ < p}U{5Rs < — 2T^} 
or s € {3?s < T, \^ss — n\ < p}U{3?s < — 2X l p}. Let us reserve the notation h.p,T,p with /3 = + or — respectively 
for the lower or the upper one of the strips above. The indices (T, p) can be omitted in accordance with 
the notational convention introduced in Section 13.11 also T — 00 can be used. In the same fashion the 
subscript (3 may be added to the notations C, H,^> , S, a, etc. Clearly, as far as the application of Theorem [21 
to the restrictions Hp over Cp is concerned, it results in the same constants Co, £ as well as the s-independent 
components So(ip) of the generating functions Sp, which then represent analytic continuations of one another 
for j3 G {+, — }. These /3-notations will be used in Section 4. 



3.3 Proof of Iterative lemma 

The proof of Lemma 13.11 consists of several steps inherent in KAM-type theorems ^| • I n the classical 
case, Proposition 12 . 3 . ll plavs the key role. Here these are Propositions IA. 1IA.2I The notations 'J, a, S, etc. 
in this section pertain to the formulation of the Iterative lemma and its parameter statement, rather then 
Theorem El 

10 Note from the remark on estimates that the smallness condition for the norm of (/2,92) to guarantee the transformation ^2 
is in fact more relaxed than the right hand side of 13.331 . However, one should not worry about the precise smallness condition 
here: Theoreml^l warrants the existence of the transformations \P' as long as 13.331 is satisfied, and this is all one needs, plus 
local uniqueness. 
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First, notice that any "affine" transformation ^>[a(b),S] described by l|3.35[l (where a S S)p.A(C), with the 
norms |b|i, P ' and |5|i iP ' small enough, say O(fi)) will act in such a way that H o £ <B K ', P '(T*C) for 
H £ *B K . P (T*C), despite the unboundedness as s — > — oo. This follows from the definitions of the spaces 
<B P (C), £>p,a(C), 2$k,p(T*C) in Section I5TT1 and can be verified directly. In particular, if S € *Bp'(C) and has 
bounded partial derivatives, the quantity is bounded. Thus a substitution (ft.,/) — > (ft,/) + into 

H3.29|) will not affect its structure as a Taylor series in p = (^y, I)- 

Also note that o <3>[a(6),0] = h ^ 1+ ^ s s ) ' v ^ , where v £ 25 P '(C) and is 0(/i). This is equivalent to an 

earlier made statement that da^ 1 g o a £ 58jj7 (C) for g £ Q3p '^(C). Combining it with the fact that by 
<|3.I7|) a single Taylor coefficient, member of 23 P (C) in the Hamiltonian will only change by O(fj) as a result 
of the action of the transformation a, one can see that H o £ <8 K ' !P < (T*C) if fi is small enough. 



Homological equation 



The quantities (a, S") are to eliminate the perturbation V to the leading order. In order to do so they should 
solve approximately the "homological equation" : 



H x0 *{a,S) - H x + V - c - Cl h = 2 (p) + 0(/x 2 ), 



(3.44) 



which is only possible for some specific values of the constants Cq, a to be found. Writing (|3.44l) out in essence 
requires only a direct substitution of (|3.35|) into (|3.29|) followed by an estimate for the "remainder" 0(/i 2 ). 
In order to get the equation for S, it is enough to plug p — > p + dS = p + (0, £) + dS into the Hamiltonian 
<|3~27?jl assuming S = 0(n). 

Furthermore if a = ao(^o) + b as in l|3.18|l . with b = (bo,b) = 0(/i), a calculation using (|3.1()|) and 
da^ 1 = (id + da' 1 db)^ 1 (da )^ 1 yields 



da 
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' A " 




' A " 
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' (-A + DJ&o 




LO 




LO 


X 






0(\b\l) 



(3.45) 



where the function ?7i(s) G S P (C) is determined solely by ip. As the constant vector (A,w) formally belongs 
to Q3p ' (C), the result of having multiplied it from the left by daT 1 is a vector-function from QSt' 1 (C), 
whose leading order is given above. 

It is convenient to think of b € «B^ 1} (C) = *B { p 7 A \C) as an element of the latter space, formally writing 

^ + b 

x 

B 



(3.46) 



{-X + D u )b 




D X ^B 



In other words, in order to do the estimates throughout the rest 
db, although s — > s + My would not be a legitimate transformation 



as the latter expression appears in 
of the proof, one can set da -1 = id 
for the s- variable, defined by Ij3.15|l . 

Then one ends up having a pair of first order linear PDEs: one for S and one for b. The first PDE is 
amenable to ProDOsition lA.fi the second one to ProDOsitions lA.il and IA.21 together, alias Proposition lA.4l It 
requires the appropriate choice of constants cq and Ci respectively, as well as the one-form dS cohomology 
class representative £ £ E™ in (|3.35(l . The latter quantity £ is chosen to ensure that the right hand side in 
the equation for B has a zero average in the sense of (|3.13f) enabling the choice of the constant c £ R™ +1 in 
Proposition [^3] simply as c = (ci,0) (owing to the non-degeneracy assumption on the quadratic part of H\, 
see Definition [2J • Thus H3.44J1 is equivalent to the following system of equations 
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CO = (/> + <£,">, 

^A.c^ - (/>-/, 

(G) ^ = 0, 

D x , u b = g 1 + D 2 H{p,q)\ p=0 £ 



(3.47) 



where 

I = (0,0, fli =9 + ^p P ^(p,q)l P =od5; g 1 = {gi,G 1 ). (3.48) 

Following Kolmogorov jT2] one starts solving H3.47|l with the equation for S, then finds £ to satisfy the 
penultimate one, then solves the last equation, the constants co,i being determined along the way. 

The equations for the quantities S, b are clearly amenable to Propositions IA.1I I A. 41 The norm of the 
solution, according to these propositions, is simply estimated by times the norm of the right-hand side. 
Applying the propositions results in analyticity loss. As a matter of fact, one encounters the analyticity loss 
six times along the way: solving the equation for S, evaluating the derivatives, solving the equation for 6, 
evaluating the derivatives, making sure that not only a £ S p .a(C), but the G 1 estimate for the norm of b 
is valid throughout the maximum range of a, and finally inverting it. Hence, strictly speaking one should 
introduce five intermediate spaces between say 2$ p (C) and 23 P ' (C), and the parameters 5, A should be scaled by 
factor 6. These standard steps are bypassed, and all the estimates for S and b, as well as their derivatives, no 
matter that they may also be valid in some intermediate (smaller) spaces, are all written in the target spaces 
<3V(C) and ^pV^C) S SBff' 1} (C) right away. The scaling of the analyticity loss parameters is absorbed into 
the constant C in (|3.3(ill3~37l) . The estimates follow from Propositions IA.1IA. 41 Definition^ and the bounds 
<|3.30|) . Here are the details. 

First, by Proposition lA. ll and the Cauchy inequality, S and its partial derivatives are in 23 P <(C), with the 
estimates 11 

\S\ P > < \dS\ p > < K^r 1 - (3-49) 

Then the quantity g x in l|3.48|l belongs to the space Q3p, (C) (in fact, any intermediate space between 
SpV^C) and <8{f 4) (C)) with the estimate 

Iffllp' ^Wl!/- 1 ], 

So the following expression is well defined: 

£ = -(D 2 II H(p,q)\ p=0 )- 1 (G 1 ). 

In order to estimate it, note that £ 6 R n depends only on the right-hand side as a real function of q. Thus, 

|£| < ^(Mpnf^AJ^ + i/- 1 ), \dS\ 9 . < (3-50) 

where in the first estimate the bounding constant depends on (a, p); this is not the case in the second, rougher 
estimate, see (|3.32|) for the formula for the quantity rj. This gives the first three of the estimates (|3.37|l . as 
well as 

|c | < /iMiT 1 (M[inf( 7 , A)]" 1 + v~ x ) < ^wlt?" 1 , (3.51) 

the bounding constant in the first estimate depending on (tr, /?), but not in the second estimate. Being finite, 
the constant Co is further dropped. 

11 One can go slightly more subtle estimating the derivative D S S, as it does not depend on So, where S = So + Si in the sense 
of the decomposition 13.121 . The norm of Si however, is estimated by Proposition IA. ll without any small divisors as /xA -1 . 
Then one can take a minimum of the following two estimates. One is to apply the Cauchy inequality, acquiring a factor A -1 . 
The other is to deduce from the equation itself that \D S S\ < A -1 (|<j| \D V S\ +At). The same thing can be done further estimating 
the C^-norm of b. So a C 1 -estimate for the solution of the equation = v on C can be obtained by dividing the norm of v 

by inf[<j<5, Asup(A, A | oj| 1 <5)] rather than A, causing a straightforward modification of 13.371 and 13.341 . 
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The upper bound on the value of c\ is then obtained from the last equation in (|3.47|) after using Proposition 
I A. 41 with the norm of the right hand side in <Bp, (C) bounded by fii]^ 1 : 

|d| < m - x . (3.52) 

Then the C° norm of b (as an element of 'Bj/^C) or of its representation l|3.46[) as an element of 58 ,1} (C)) 

is bounded in terms of /i(ry^) -1 and the C 1 norm - in terms of /i(r^A) _1 , which also ensures \b\p> < A, i.e. 
a, G 3?p,a(C), as well as its inverse if one takes a big enough constant in (|3.37|) . 

Finally, the presence of the additional multiplier (n — k'), in the smallness condition 1|3.36[1 is to ensure 
that the image of B™, +1 in (non-canonical) momenta p = (^y,/) under the map ^(a,S) is contained in 

B™, +1 , which is tantamount to requiring |6|i. P ' < k — k' . The smallness condition H3.36J1 by itself guarantees 
that |ci| < A. 

Analysis of transformed Hamiltonian 

It remains to estimate the term 0{^ 2 ) in (|3.44|l . Let us assume the smallness condition l|3.36[l for \i and use 
the above obtained bounds for S and b. Let C be large enough, say 100(n + 1) times the bounding constant 
for all the inequalities in the preceding section. 

1. From H3.45fl it's easy to see that id + db is essentially a "stretch factor" for non-canonical momenta p. 
Thus for the growth and non-degeneracy parameters M' , R! of the Taylor series Hoty g 58 K <. P '(T*C) one 
has M~ X \M' - M\, R~ 1 \R' - R\ < |6|i,p'. Together with (|3.52|) it gives the corresponding estimates 
of (|3.37(l . The contribution from the "shift" by dS in (|3.35|l is negligible, as the non-degeneracy 
assumptions in Definition 13 . 281 are global in p £ B™ +1 . 

2. The "new" perturbative momentum-zero-order term /' is formed by several contributions. The first 
one comes from the momentum-super- linear part of H and is bounded by M^SIf p/ . The contribution 

from the linear terms is bounded by |ci||i?|i,p/ + |g|p|<5|i,p'- Note that as D S S G !8i,(C), the "new" 
momentum-independent term /' is in 2$ P ' (C) indeed. The final contribution is / o a — /, with a bound 
jtt|6|i,p', by (|3~T7I) . Combining it with (|33U|) and $£52$ yields 

l/V < c^S- 2 - 

3. The "new" perturbative term g' in the first order in the momentum has a component, coming from the 
momentum-super-linear part of H, bounded by M|b|i ! p'|S'|i ! p'. Another contribution comes from the 
acquired term cih; its norm can be bounded by |ei||b|i )P >. Finally, the remainder da~ 1 (goa — g) has to 
be taken into account, with the bound /ii/ _1 |6|i. p /, by l|3.17|l . The first contribution clearly dominates 
the second one, and one can write 

IffV < ^^[Mr)- 2 ^)- 1 + p- 1 V - 1 ^A)- 1 } ^CV^'suplAf^A)- 1 ,^ 1 ]. 

The last pair formulas complete the set of estimates Ij3.37|l and the proof. The final remark to make 
here is that formally setting daT 1 = id — db with b G S^, (C) = 53p?' 1 ''(C) does not bring in extra 
error. E.g. it can be taken precisely for the sought quantity, to which after b has been determined one 
can unambiguously match a transformation a G £> Pj a(C) as long as |b| p < is small enough. □ 

4 Splitting problem 

This section contains the principal part of the proof of Theorem ^ The theorem follows from the analytic 
splitting theory in T*(C co ) 1 developed further on the basis of the main results of the previous section. 
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4.1 Preliminaries 



Energy-time coordinates 

Let us start out with the necessary additions to the set-up in the beginning of Section I5~T1 One still uses 
formulae 1)3.11 13.5]) for the energy-time coordinates (h,s). However there is extra structure underlying the 
splitting problem. Namely, suppose the function ip{x) introduced in Section 1X11 and determining the trans- 
formation s is 27r-antiperiodic (as it is in l|2.11[l due to reversibility of the truncated normal form l|2.7|l ). Then 
ip(x) is 47r-periodic, and the domain T> for the £- variable is a complex extension of T' = R/47rZ. Technically, 
assume that it contains a pair of balls of radius r € (ry,, 2r^) centered as x — and x — 2ir and that outside 
these balls in V one has IV'Oe)! > r /2- In addition, without loss of generality one can assume that 

where P.V. indicates that the integral is taken in the principal value sense 12 . 

Rewrite the 27r-antiperiodicity property (|2.12|) as ° ip = t, o %p, where i : s — > — s is sign inversion. 
Extend the latter to a diffeomorphism i : {s,ip) — > (— s, (f). As the image of the map s defined by l|3.1|l 
acting on T> \ {0, 27r}, one can simply consider an imaginary circle C/2i7rZ, as all the functions of s we are 
dealing with here are 2i7r-periodic. Otherwise a branch of s can be fixed by drawing a branch cut in T> as 
a "semicircle" in T', connecting the points x = and 2ir, but not containing x — ir. The inverse map x is 
represented by a homonymous function of a complex variable s, which is 2i7r-periodic and analytic in the 
bi-infinite bi-strip At, p U , see (|3.3f) . For now, 47r-periodicity of the function ip does not allow one to 
distinguish the values of x(s) modulo An. The half-width p and the parameter 02 can be still defined by (|3.7I 
13. 8J) with the choice of, say T = T^,> — 21ogr^, i.e. the level set 3?s = T will be contained inside the ball of 
radius 2r^ centered at x = 2tt. In other words, 

p = sup{£ > : both curves 3s (x) — ±£ lie in T>, connecting the points x = and x = 2tt}, 

while 1T2 = inf sup \^sx\ over the above pair of curves. In this case one should definitely have 

P<~, (4-2) 

for a bounded T>. One reason, for instance is that the level curves 3?s = =fT for T > are contained 
inside balls of radius 2r^ centered at x = and x = 2ir respectively and are not homotopic in T> \ {0, 27r}. 
Furthermore, in terms of the maps s, x (|3.1|l the 27r-antiperiodicity of the function ip(x) combined with 14.11) 
result in: 

S O Z 2 tt = Un L S, X O L — ?27r x o ^ i7r , (4-3) 

where li„ : s — > s+iir (also defining a diffeomorphism 1^ : (s, ip) — > (s+iir, tp) and a canonical transformation 
Lin acting on the momenta as the identity) and l±i- K arc identified on C/2i7rZ as well as l±2n on T'. 

Fig. 3 provides an illustration, and the antiperiodicity property is expressed there by the fact that the area 
where € [— 27r,0) is congruent to the area where $lx £ [0, 27r) flipped about the real axis and translated 
left by 2tt. Then as far as the definition (|3.7() of the quantity p, relevant to the pair {tp,T>) is concerned, by 
continuity there must exist a level curve 7^ of $ss(x), emanating from x = with |£| < n/2 which will exit 
a bounded domain T> before (ever) arriving to the point x — 2tt. This necessitates the existence of singular 
points of the functions x(s) off the real axis for |3?s| < T^, p < |3s| < |, inside the unshaded rectangular 
regions in Fig. 3, as it is the case with the classical pendulum, where the singular points are s = ±i5- 
Dealing with the classical pendulum where ip(x) = 2sin(x/2), the semi-width of T> can be taken arbitrarily 
large which means p = p(ip, T>) approaches -| from below 13 . For the pendulum ip(x) is also odd, so one should 
have D x tp(n) = D s x(0) — and x( s ) — 2sechs, an even function of s. 

12 Otherwise the lower limit of integration w in the defining formula 13.11 should be substituted by some a S (0, 2ir) and the 
integral in 14.11 shall be taken from a — 2tt to a. Such an a always exists by continuity, 2-7r-antiperiodicity and positivity on 
(0,2tt) of tp{x). 

13 As p approaches ^ the functions from the space Q3p(C), etc. naturally grow unbounded. For optimal splitting estimates one 
wants to have p as large as possible, which results in various technical nuances in the literature, see 1141 . IIUI etc. 
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Figure 3: The map x. Branch cuts on the right image are unnecessary if one docs not distinguish the values 
of x modulo 47r. The cuts along the horizontals 3 s = ±5 are the (coinciding) images of the vertical dotted 
diameters of the small circles centered at x = —2ir, 2ir. The cut on the right image, where 3s = ? [3s = — ~] 
corresponds to the dotted radius positioned above [below] the real axis on the upper left image. The lower 
left image illustrates how various branches of x(s) as a multi-valued function can be constructed. 



However, further analysis of Hamiltonian H2.21ll3~2"4*)l . see also (|2.22(l . does require the ability to distinguish 
the values of x modulo 4ir (unless 8 £ Z"). In order to do so one can treat the function x(s) whose domain is 
shown in the upper left image in Fig. 3 as a multi-valued, bi-real-analytic, 2i7r-periodic function with values 
in C. Its single branch labelled by j G Z will be defined by fixing x(0) = 4atj. This is equivalent to taking 
I? as a bi-infinite strip about the real axis, drawing branch cuts at all the translations of [— 2ir, 0] by 4irj 
and taking the lower limit of integration in (|3.1|l equal to w + iwj, see the lower left image in Fig. 3. As 
a multi-valued map, x(s) has branch points inside the unshaded rectangular regions in Fig. 3. There are 
branch cuts emanating from a corner of each shaded region, whose exact appearance depends on how exactly 
the covering of T> (identification of x S C modulo Air) is defined. See the caption to Fig. 3. 

Application to Hg 

In this case the pair (-0, T>) is well defined, see the end of Section 3.1. Recall that the Hamiltonian Hg (|2.21|) 
is viewed as a multi- valued function on T*(T' xT"), where in view l|2.22|l a branch is identified by a (real) 
value of Hg at (y,I,x,(p) = (0,0,0,0). The application of the transformation S s to a chosen branch of Hg 
results in the "new" Hamiltonian H s o S s = H s (h,I,s,ip) 6 25 K , P (T*C), with p = (r, oo,p, a) given by l|3.24() . 
Application to any other branch of Hg is tantamount to the shift of the angles ip by a multiple of AttQ and 
does not require extra consideration. As the values of the variable x cannot be identified modulo 4ir (|2.22(l 
unless 9 E Z™, as far as the domain for the variable s of H s , is concerned it should contain branch cuts, e.g. 
as shown in the right image in Fig. 3. The splitting problem for H s can be briefly described as follows. 

In the absence of perturbation, the bi-infinite bi-cylinder Coo is an invariant manifold W t which is the unstable 
manifold to an invariant torus 7o jt at s = — oo, sec Figs 1,3. As s — > oo along the two different lines 3s = 
and 3s = 7r, one arrives into a pair of different tori 7^ jt {[3 G { + , — }, the + sign corresponding to the former 
line; compare with Remark 3.7) corresponding to x — ±2tt respectively in Fig. 1. W t is a part of the stable 
manifold for these tori, which can be analytically continued further by choosing one of them and then flipping 
the branch cuts in Fig. 3 with respect to the imaginary axis, whereupon the chosen branch (above or below 
the branch cut along the line 3s = § in Fig. 3) can be analytically continued into the strip, whereof it was 
separated by a branch cut. And so on. 

The perturbed situation, with respect to Figs 1, 3 is qualitatively as follows. Let H be the restriction 
of H s on T*Ct, for T < oo. Theorem |2 stipulates the existence of the perturbed manifold Wq, defined as a 
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graph over the semi-infinite bi-cylinder Ct in terms of a generating function Sq. Wq contains an invariant 
torus 7o near (in the sense of Remark 3.2) {s = — oo}, for which it is an unstable manifold. Furthermore, in 
order to apply Theorem [5] (twice) on the bi-cylinder going into {s = +00}, one should use the analytical 
continuation of H s (-,s) in s from either the strip |3s| < p or the strip \Qs — tt | < p. Such an analytic 
continuation is roughly tantamount to flipping the branch cuts in Fig. 3 with respect to the imaginary axis. 
Denote these analytic continuations as Hp for j3 = +, — respectively. By (|2.22[) these analytic continuations 
should in particular differ from one another by the 47r(9-shift of the ip- variables and can be both described 
using the sputnik Hamiltonian H = Hq o L~ 2 to Ho, the set-up of Corollarv l2. 41 being guaranteed by l|2.22|) . 
An application of Theorem|2]to each Hamiltonian of the pair Hp results in a manifold Wp defined as a graph 
over the semi-infinite bi-cylinder Ct via a generating function Sp and containing an invariant torus Tp near 
{s = +00} (i.e. near x — —2ir, 2ir for He, with (3 = signs) where Wp is the stable manifold. 

The Hamiltonians Hq and Hp coincide for s in the bounded strip around R and R + respectively for 
f3 = +, — . This enables one to define the splitting function on a finite bi-cylinder C as So — Sp for s in the 
corresponding strip. An important issue that the cohomology classes of the one-forms dSo,p are all equal to 
one another follows easily from H2.22|l and Corollarv l2.4l 

Technically, let us start out by calling Hp(-,s) the 2i7r-periodic restriction of H s (-,s) into the complex one- 
strips A^ j00iP described in Remark 3.7. Instead of dealing with Hp near {s = +00}, consider the Hamiltonians 
Hp o 3p, where 

(-h, I,-s,ip), P = +, 

(—h, I, -s + 2in, ip), (3 = —. 



Jp : {h,I,s,ip) 



Addition of 2in to — s in the second line is optional inside the functional dependencies, as all the functions 
of s involved are 2i7r-periodic. It has been done above simply to make sure that the restriction bp of 3p to 
the base space maps the strip |3s — tt\ < p into itself; it uses 2i7r-periodicity of H s (-, s) in s. Then combining 
lt2~2^jl with (|3~5|) one gets 

Hpo3p = H_p o L- 2 o o L;f ee H_p o L m<p , (4.4) 

the transformation L x having been defined earlier by l|3.25|l . Namely (the correction below [+2i7r] not 
appearing for f3 = +) 

Hp(~h, I, -s[+2wr], <p) = H^p(h - 2\ x 2 {s - 0iir),I, s - pin, p + /32tt0). (4.5) 

Indeed, the last two formulas follow from l|4.3|l regarding the presence of branch cuts for the map x(s) by 
simply matching (3 with signs, for real x. Namely, in the second formula in (|4.3|) the shifts 1±2-k and 
have been identified. To make a choice of the sign for the formula l|4.4|l all one has to do is to act by l-2n 
on 2; G (0, 2ir) and by ^ on x G (— 27r, 0); in the same fashion l-m acts on s : 3s = i, corresponding to 
x G (— 2tt, 0), and acts on real s corresponding to x G (0, 2ir). 

The two functions Hp(-, s) o3p allow analytic continuation in s into the one-strip A^^.p, see Remark 3.7, 
where they are characterized by the same array of non-degeneracy and smallness parameters (in particular 
v = 1). The pair of transformations £* l( g defined in (|4.4|) plays the role of sputnik transformations, as they 
obviously satisfy Assumption [5] On the other hand, the Hamiltonians Hp(-,s) represent the same analytic 
function Hq(-, s). The rest of the development is clear: Theorem|21and Corollary 12 . 41 are satisfied by Hq and 
a pair of its sputniks Hp o 3 p. We proceed with some generalization. 



4.2 Splitting theory on Coo 

Let (3 £ Z 2 = {+, — }, K 3> 1, P = (r, 00, p, a). Consider a Hamiltonian H = Ho G 25 K , P (T*C) in the 
cotangent bundle T*Coo of the bi-infinite bi-cylinder Coo, in the form (|3.29|l . Let Hp(-, s) be the restrictions 
of the function Ho{-, s) into bi-infinite strips |3s| < p and |3s — 7r| < p for /? = +, — respectively. Suppose, the 
Hamiltonians Hp o Jp = —\h + (uj, /) + ... allow analytic continuation into one-strips s G A^ j00iP introduced 
by Remark 3.7, namely into the region {5Rs < —2T^}, 2i7r-periodically. For the above analytic continuations 
let us still use the notations Hp o Jp, the latter quantities being well defined in the cotangent bundle over 
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the one-cylinders Cp iV . Suppose, the restrictions of the quantities Ho{-, s) and Hp o3p{-, s) over -fts < T < oo 
satisfy the conditions of Theorem[21 with the same analyticity parameters k, r, T, p, a and equivalent (footnote 
|5J) non-degeneracy and smallness parameters A, R, M, p, v. 

In particular this means that for p = the by-infinite bi-cylinder would be an invariant Lagrangian 
manifold for H Q , asymptotic to a torus T at s — — oo and a pair of tori Tp at s = +00, see Fig. 3. In 
addition, assume the following. 

Assumption 3 (Sputniks on Coo) There exists a pair of canonical transformations 

L., fi = L.A"f»S;): [I I ^y^ +dsm (4-6) 

where a*p are diffeomorphisms of Coo, such that a*^ — id G p3 p (C)]™ +1 ; as well as Sp £ Q5p(C), such that 

Hp o 3p — H_p o L* |( g. 



Then one has the following lemma. 

Lemma 4.1 Let i £ {0,/?}. There exist invariant Lagrangian manifolds 



Wo = {(h, I; s, if) G X C : h = D s S (s, I = D v S (s, y>)}, 

W), = {(M;^)eR" +1 xC p -; ^D^f^J./^^t^)}, 1 " ' 

contained in the level set ff _1 (co) for some cq satisfying 13.51}) . One has So £ 58 p /(C), Spou G SSp'^^g), t/iey 
satisfy the corresponding bounds of \3.34\j and [dS\\ = £ G R™, Vi. TTie manifolds W\ contain invariant tori 
7[ (whereupon the flow is conjugate to a rotation with the frequency u>) near s = —00 for i = and s = +00 
for i = /3. 

Proof : The lemma is an immediate consequence of Theorem [2] and Corollary 12.41 see also Remark 3.7 
following the latter. Indeed, Assumption [21 implies that Assumption is satisfied in the sense that Hp o 3p 
is a sputnik of Ho (restricted as a function of s from the bi-strip At, p to a one-strip A—p t x,p to yield H_p) 
under the sputnik transformation L*_p.O 

The above lemma is central for the splitting problem, for now one can introduce the splitting distance as 
an exact one-form on the bounded bi-cylinder Ct> — (I Cp.T' by defining it separately on each of the 

/3e{+,-} 

above components (corresponding to f3 = signx as far as the original simple resonance splitting problem is 
concerned) : 

d6(s,<p)=d[So(s,<p)-S (s,<p)], (s,<p) G h-p,T', P > x (4.8) 

Let us call the function 6 € 25 P '(C) the splitting potential. Note that the (complexified) domains Crt 1 ,p',o' 
are disjoint for different (3. In terms of the notations introduced in (|3.3I I3.4|l and Remark 3.7, one has 
A +i t jP = ILr iP , A_.x !P = TLt.p + i, where Ht,p is simply a symmetric rectangle in C with the half-length T 
and half- width p <\. 

To make things easier, let us simply view S(s, tp) as a double- valued function, bounded and real-analytic 
for (s, <p) £ n<7" p' x T™, by changing s — > s + iir for s G A- t T',p' (the strip about the line 9s = it). Hence the 
rest of the statements and estimates will be valid for either one of the two branches of 6 over the domain 
II x T", the index [3 being mostly omitted. The functions one is dealing with are members of the space 
®T' p' cr'(H x T"), whose element u can be represented as a uniformly convergent Fourier series in ip with 
coefficients Uk(s), k G Z n , bounded and holomorphic for s £ Ely p/, with u_fe(s) = ut(s) = Uk(s*) (* marking 
the complex conjugate). Thus in the sequel parameter vectors p will have three components (T,p,a), and 
as usual p' < p and A = \p — p'\. Besides the use of a certain finite number of intermediate values p" of the 
analyticity parameters such that p' < p" < p is implied by default along the way. 
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Lemma 4.2 The splitting potential 6 satisfies a homogeneous quasi-linear PDE in ILpi p i x T™// 

([A + X„(s, <p)]D s + (lu + u^s, ip), D v ))& = 0, (4.9) 

where the pair g 

= (A^^v) G [ ( 8 T ', p ', (T '(n x T n )]" +1 satisfies the bounds 

|(A„> w /*)Ip' <C m -\ (4.10) 
where rj is defined by fc/.yty) and C is of the same order as in Theorem^ 

Proof : Follows by Corollary 12. II each single S is a solution of the Hamilton- Jacobi equation for H on the 
energy level Co, thus i|4.9|) is obtained by subtracting the equation for the former function from the same 
equation for the latter one. The estimate 14.10JI follows from Definition [3 and the bound for g in Q3.3()|l as 
well as the bound (|3.34|l for the norm of dS (which also turns out to be the estimate for M\dS\ , see (|3.49I3.52|) 
for detail. As the result one may have to multiply the constant C in Theorem I A . 1 1 by a factor, depending on 
n and r only. □ 

A prototype of the following lemma is due to Eliasson 

Lemma 4.3 A branch of the function & for (s,(p) G II x T n has at least n + 1 critical points tp c = (p c (s), 
given s, i.e. where D v &(s, (p c ) = 0. 

Proof : This statement is a consequence of the fact that given s € II, the function S(s, <p) is 27r-periodic in 
each component of <p, by Lemma 14. II essentially stating that the one-form d& is exact. The number n+ 1 of 
critical points is the Ljusternik-Schnirclmann characteristic of the torus T". □ 

The statement of the next lemma is similar to Corollarv l2.2l claiming the structural stability of the constant 
vector field Xq = Aj| + (u>, -g-) on the bounded one-cylinder II x T" under small perturbations. It is crucial 
for the exponentially small estimate (|2.19|l . The prototype of this result was proved by Sauzin Y67\ regarding 
the so-called characteristic vector field. 

Lemma 4.4 There exists a constant C — C(n,p) but independent of u>, such that for x = xo + g, with 
g G Q5 p (n x T") such that 

\g\ p < C- 2 \a, (4.11) 

there exists a diffeomorphism a = id + b with b G [55 p /(II x T™)]" +1 , effecting the conjugation 
da^ 1 o x o a = Xo, with 

\b\ iy < C\g\ p (\A)-\ (4.12) 

Proof : This lemma is yet another implicit function theorem regarding the operator D\ :U1 introduced in 
l|3.22|) . One can rewrite the conjugacy problem in question as 

go(id+6)-£> A , w 6 = 0. 

Or in Hamiltonian terms, one seeks a canonical transformation 4/ : q = a(q'), p = l (da) 1 p / , such that a 
linear Hamiltonian H(p, q) = Xh + (w, /) + (g{q),p) is conjugate to Xh + (uj, I). 

The vector field conjugacy problem is certainly amenable to an abstract implicit function theorem |4()j . 
[35j however the latter would not provide the optimal condition l|4.11J) . as well as for the conjugacy problem 
H3.21(l . In order to get (|4.11(l one should follow the standard iterative scheme mimicking the proof of Corollary 
12.21 ( which in turn is a particular case of the proof of Theorem|2J) basing it however on Proposition ! A. 5l rather 
than Proposition lA.il 

The latter proposition analyzes the possibility of finding a solution u to a PDE D\ ^u = uon li-r.p x T™, 
such that |u| p / can be bounded irrespective of lu. The kernel of the operator Dx ^ on 5B p (n x T n ) consists 
of all functions, which are represented by Fourier series in the variable <p = <p — js, and the norm of such a 
function in TIt,p x T™ clearly does depend on uj. To avoid it one is naturally led to solving a Cauchy problem 



set up by conditions (|A.2I IA.3|) in Appendix A, as a way to determine the required inverse of the operator 

A single application of Proposition lA~5l furnishes the approximate (first order) solution 61, satisfving l4.12l 
whereupon the perturbation g changes to g 1 , the norm of g 1 bounded by |<7| p |&i|i.p'. This fact constitutes the 
analogue of the Iterative lemma l3~Tl whereupon the standard dyadic iterative procedure (see e.g. Appendix 
B) is run. □ 

Results of the type of Lemmas 14. 214. 41 as far as exponentially small splitting is concerned have been a target 
of a number of works of Lazutkin starting from ^H] and followers, see e.g. ^U]. Indeed from Lemmas 14.214.41 
one can further easily deduce the following upper bound for the infinity norm (the supremum over the real 
values of the variables only) for each branch of the splitting function &. 

Theorem 3 Suppose, the assumptions of Section \4.1^\ are satisfied and the smallness condition holds, 
with a large enough C = C(n,T,ip,p). Then 

\&U < C m - 1 ]T e X p(-\(k,j)\p'-\k\a'), (4.13) 

fcGZ»\{0} 

where n is defined by $3.3Hfy . p' = p — A < n/2, a' = a — 5. 

Proof: The estimate is clearly the same for each branch of the double- valued function 6 on LT X T™. 
Consider one branch. Let a be the conjugating diffeomorphism of Lemma |4. 41 (the estimates of the lemma 
are uniform in (3) . Then the function & = 6 o a is constant along the flow lines of the constant vector field 
Xq on the bounded one-cylinder II x T™. Then it is a real- analytic function on T": one can formally write 
& = &(ip — js) = &(4>), where <fi — (p — js and expand it into the Fourier series 

S'(s,</3)= 6' k e- t( ~ k ^ s e t{k ^. (4.14) 

feez™\{0} 

Then 6'(s, (p) is a quasi-periodic function of s E ILr'y. Besides the standard complex analysis technique for 
estimating the Fourier coefficients yields (once again scaling the analyticity loss parameters by, say factor 4) 



< e-l fc l CT '|e| p ,, Vsen TV . (4.15) 



This implies, as \Qs\ < p' that 



\&' k \ < |6| p ,exp(H^~)|p'-|fc|c/) 



and consequently (|4.13|l as one can use for |6| p /, while |S|oo X \&'\oo < }^ \6' k \. □ 

fcez™\{o} 



Conclusion of the proof of Theorem 111 

The splitting problem for the Hamiltonian H s given by 13.24f) satisfies the conditions of Theorems |21 and |21 
with lu = ui = -jM=*. R = 1 and M = M /R . 

V g-n -0 

Assuming (|2.8|) and letting p = y^A 2 (A/o/ V^o) inf (^o<5o ; Rq), v = 1, it is easy to check that for the 
application of Theorem|21(with n > 1, well-defined quantities r^,T^, as well as p defined by (|3.7I) . cr = o"i + i-<5o 
and A < 5 ) one has ry _1 < sup(M Rq 1 y/eRo(X<;oSo)~ 1 , 1) < A -1 . Thus (|2.8() with the proper choice of the 
bounding constant, depending on (n, r, -0, T>, A, cr) ensures the applicability of Theorem|3 

Besides, the sputnik property, in order to satisfy Assumption |21 has come a long way: 1)2.161 l2~2*2*l I4.4|l . 
To establish the fact that all the three tori %^ claimed by Lemma f4 . 1 1 correspond to the same torus T (see 
Fig. 1) for the Hamiltonian (|2.6|l one should chase back through relations 14.41 12.221 12.16|) and notice that 
the manifolds Wp arise from the sputnik manifold Wq to Wo, described by Corollary 12.41 with the sputnik 
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transformation = L~ 2 , alias L^ 2 , solely via a translation x — » a; — /327r in terms of the variables (x, </?) of 
Hamiltonian (|2.14|) . 

Finally, we show how the transformation (|2.20|> with the underlying base space transformation ag : 
(x, <f) — > (x, ip + 6x) affects the estimate (|4.13|) . Suppose 6 S is the splitting potential, defined according to 
14.811 for the Hamiltonian H = H s . Theorem [21 followed by Lemmas 14 . 214 . 41 imply that the Hamilton- Jacobi 
equation for the quantity — & s o s _1 o a^ 1 in the variables (x, ip) of Hamiltonian l|2.14|l is conjugate to 

(Xtp(x)D x + (ui + X6ip{x), D v ))6'^(x, ^ = 0, 

by a near-identity change a of variables [x, ip) (where = 6^ o a) with the total analyticity loss of the 
order of 5q. Therefore instead of (|4.14() one has 

&{s,tp)= & k e~ l{k ^ s+ex ^e l{k ^. 

fc6Z"\{0} 

where & = &L o s. Then with p' = p — A and a' 2 — <?2 — A, by mimicking 14.15JI one gets the estimate 

\& k \ < |6^| > , e xp(-K* > ^w 1 + / a fl>|-|% 1 ) 1 

which completes the proof of Theorem ^ upon removing primes for the pair (p,a 2 ) in the formulation of 
the theorem. Note that the lower bound 2n + 2 for the number of homiclines comes from the application of 
Lemma T4. 31 to the two (upper, lower) separatrix branches, see Fig. 1. □ 

5 Appendices 

A First order linear PDEs on bi-cylinders 

The following set of proposition addresses the issue of the existence of the right inverse for the operator D\ M 
defined by (|3~2l)l . with u G 2U; l , 7 and A > 0. 

Proposition A.l Let p = (r,T,p,o~) and v £ 2J P (C). There exists a real c, |c| < |i»| p , such that the solution 
of the equation D\ yU} u — v — c exists in Q3 P '(C) for p' = (r, T, p, a 1 ) with < a 1 < a. Let a — a' = 5 and 
<; = inf (7<5 T . A) . Then 

Me' < <r>| P . 

Proof : Following (|3.12|1 . let v = vq+vi, where vq £ <B CT (T™) and v\ £ Q5i(C). Seek u(s, <p) = uo(<p) + ui(s, ip), 
such that D^uq = vq — c and D\^u\ = v\. The first equation obeys Proposition I2.3.T1 if one chooses 
c = (vq). The second one is solved in C p by the method of characteristics, regarding the fact that one can 
write Vi(s, tp) — x(s)w(s, ip), for some w £ Q3 P (C): 

Ui(s,ip) — j°^ oD Vi(s + Xt,ip + ujt)dt = A -1 f®^ x(s + t)w(s + t, ip + ujX~ 1 t)dt 

= ^SloowxttMCv-ui-H'-CM (A.i) 

= A -1 J* w[s(x), ip — wA _1 (s — s(x))]dx 

The latter integral was obtained via the substitution £ = s(x), see l|3.1[l the integration fulfilled along the 
level curve 3s(x) = 3s, see Fig. 2. The latter integral is bounded by a {tp, 2?)-depending constant times the 
norm of V\, see <|3.11ll . Thus u\ £ 2$i(C), with the norm \u\\p < A _1 |u| p . Bi-real analyticity of u follows by 
construction. □ 
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Proposition A. 2 Given v <E 25 CT (T") the solution of the equation (—A + D u )u — v exists in *B a (T n ) and 

Proof : Clearly u(<p) can be found explicitly as a Fourier series in tp, whose coefficients Uk are expressed via 

the Fourier coefficients Vk of vUp) as follows: Uk — — : — r, k € Z n . If = w£ (complex conjugate), 

—\ + i(k,uj) 

this property i.e. real analyticity is clearly retained by u. □ 

Proposition A. 3 Let p — (r,T,p,a) and v £ *Bp(C). There exists a real constant c, \c\ < \v\p, such that 
the solution of the equation Dx^u = v — c exists in 25 ~, (C) /or p' = (r,T,p,a') with < a' < a. With the 
same 6 and <j as in Provosition \A.ll one has 

Me' < 

Proof : By Definition of the space 2$p~ (C), t> admits a unique decomposition v(s, = + ^i(s, </?), where 

Mo(y) 
X(s) 



u e Q5 CT (T") and w x e 2* P (C). Seek the solution u(s, p) = -^f- + u x (s, p). Then 



- -A h Aj uo + D\,uUi = h vi - c. 

X V X / X 



of the equation (—A + -D w )«o ~ v o, which exists by Proposition IA.2I while U\ should satisfy 



As — = 1 + x{s)m{s), see <|3~T51 EH5t where the function r/i(s) € 58 P (C), Mo can be taken as the solution 



D\ >u Ui =Vl + \T)xUq - c. 

The first two terms in the right hand side are members of 25 P (C), so Proposition lA.il does the job, with the 
constant c = + (vo}ipxx(0)- D 

Combining Propositions lA.ll and IA.31 one gets 

Proposition A. 4 Let t> £ 25p~' 1 ' ) (C). There exists a constant c e |c| < |u| p , s«c/i i/iai i/ie solution of 

the (n + l)-vector equation Dx ^u = v — c exists in 25 p , (C) for p' = (r, T, p, a') with < a' < a . With the 
same 6 and $ as in Provosition \A.ll one has 

Hp' < s->\p- 

Let v = (v, V). If for j = 1, . . . ,n, (Vj) — 0, then c — (c, 0), where c is the same as in Provosition \A . ,9t 

The proposition herein pertains to a conjugacy problem on the bounded one-cylinder II x T n . Let p = 
(T, p,a). A function v € 25 p (II x T" ) is given as a Fourier series 

v(s,p) = £ v k (s)e^\ 

where w_fc(s) = v^(s) = Vk(s*); * marks the complex conjugate. Given u> € M. n (not necessarily Diophantinc) 
let 

ZZ = {ke Z n : (k,u) > 0}. (A.2) 
For k £ and a fixed p' < p (let also p' = (V , p', a') < p, A = \p - p'\) denote 



u kM = -- 



VkiiQe-^dC (A.3) 



(i 



For k £ — Z", let Uk[v] = u*_ k [v], for k such that (k,oj) = 0, let Uk[v] = 0. 
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Proposition A. 5 Let A > 0. A Cauchy problem 

D XjU ,u = v, u(0,<p) = Mv]e l{k ' v) 

has a unique solution u S 5^(11 x T™) } with \u\pi < CA _1 |w| P; \du\ p/ < C(Aa) _1 |d|p where C may depend 
on n and p but is independent of u. 

Proof: Seeku(s,(^) = V" u fc (s)e*<*'^ . Then u k {s) satisfy 

fcGZ" 

*>u' k + i(k, uj)u k = v k , u k (0)=u k [v]. 

For k such that (k,oj) — let 

Uk(s) = - v k (t)dt. 
A Jo 

For fc <E let 

where the integral can be taken along the part of the imaginary axis until 3£ = 9s and then along the 
horizontal line. For k G — Z™ take the lower limit of integration as —ip'. Then the integrand is always 
bounded by sup senT |^fc(s)| in the absolute value. The initial conditions are satisfied: by real analyticity 
of v, one has V- k (—iQ — v k (i() for ( e K. In particular, the solution u is also real analytic. Note that for 
the elements u(s,ip) of 2} p (n x TP 1 ), represented by Fourier series in ip with analytic coefficients u k (s) the 
supremum norm in II^p x T™ is equivalent to the norm defined as 2, SU P l u fc( s )li the comparison constants 

feGZ™ J- ,p 

depending in particular on a. □ 

B Conclusion of the proof of Theorem [2] 

First note that the smallness condition H3.33|) has simply combined the smallness condition (|3.36|) on p for the 
Iterative lemma to be valid with the lemma's remainder estimate Ij3.37|l on pi , simply to ensure that p! < p. 
The rest to ensure that actually p! is many enough times smaller than p, so that the Iterative lemma can 
be applied again and again, with a smaller and smaller analyticity loss. This is achieved simply by choosing 
the constant C in l|3.33[) small enough. Without loss of generality (for this part of the proof) assume that 
k — k' = r — r' = p — p' = T — T' = a — a' = A = 5, as well as S < A and v > M <;S. 

Take a geometric sequence {Sj = 2~ J 5}j>i. Let Oj = a — Ya=i$1- Define sequences {k, r, T, p}j>i in 
the same way. Denote jj,- — (rj, Tj, pj, Oj). Identify the parameters n, r, T, p, a, A, R, M, v, p with themselves, 
endowed with zero indices. Let Co > 1 be the constant, whose existence is stated by Lemma 13. II 

If p satisfies the smallness condition (|3.33ll with some C > 2 2t+3 Cq, the assumption (|3.36|) of Lemma 
13.11 is satisfied for a single application of the lemma, with an analyticity loss Si = 2 _1 (5 and a parame- 
ter ft = 2~ r <; (playing the role of <r in lj3.32|l implying that r]i > 2^^ T+1 ^-q. This results in a coordinate 
change Si = Hi(ai,/Si). At the output, according to Ij3.37|) one will have the perturbation parameters 
v\ > 2~ {T+1) M- 1 5 T+1 and p x < 2 2t+2 C%C~ 2 p < 2~ 2t "V, as well as the new quantities A x , R u Mi, such 
that 

mi = sup (A^IAq - Ai|, R^lRi ~ Ro\, M^Mi - M |) < 2 2T+2 C a C~ 2 < ± 

o 

Now let C > 2 2t+5 Co, and assume that the Iterative lemma can be applied repeatedly for j > 2, with an input 
parameter set {Kj-i, Tj-i, pj-i, Tj-i, (Tj-i, Aj_i, Rj-i, Mj-i, pj-i, Vj-i} and an analyticity loss Sj, resulting 
in a transformation tyj — ^j(aj,Sj) and an output parameter set {Kj,rj,Tj,pj,aj,\j,Rj,Mj,pj,Vj}, such 
that 

Vj > (2M)- 1 2-^ r+1 )S T+1 , 
pj < 2-^ 2T +^p, 

rrij = supiXj^Wj-Xj-ilRjWRj-Rj-ilMr^Mj-Mj-i]) < ± 
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This would imply that the sequence {/J-ji / J 1 }j>i vanishes geometrically. 

Suppose, the above assumption is true for I = 1, — 1 (the case j = 1 has been checked). Then on 
the jth application of the lemma, one can let ^ = 2~ JT i;, hence r/j > 2 _ ^ r+1 )~ 2 ?7, because the inductive 
assumption implies that Aj_i > -|, Rj-i > , Mj-i < 2M. The condition l|3.3t)[) where each parameter 
involved has been endowed with an index j is satisfied; in fact, the right hand side of it majorates a vanishing 
geometric sequence with a ratio 2 2t+3 , whereas the left hand side is majorated by a vanishing geometric 
sequence with a ratio 2 2t+4 (by the induction assumption). Another application of the Iterative lemma 
yields 

Mj+i < 2( 2r + 2 )+ 5 C 2 C- 2 ^ < 2-( 2r+4 V,, 

The fact that rrij < | is easy to verify, similar to the case j = 1. This justifies having 2M in the above 
assumption about Vj and completes the prof of the induction assumption. 

Now the statement of Theorem [5] and its Parameter statement follow from chasing through the iterative 
scheme (|3.27|) and the estimates of the Parameter statement of Lemma 13.11 The existence of the limits 
a = oi o 02 o . . , and S = J2fLi Sj follows from the fast convergence of the estimates for their norms 

and completeness of the spaces QS P '(C), Sp?' 1 ^ (C) . Finally, the estimates (|3.34|) pretty much reproduce the 
corresponding estimates of the Iterative lemma. Indeed, due to the geometric convergence of the series 
Sj>i ^7 ano - the composition ai(6i) o 02(62) o . .., it suffices to estimate the norms of 5*i and b\ only. □ 
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